Question: Differences between ensembl gtf files and gff3 file
gravatar for oma219
3.8 years ago by
oma21930 wrote:

Is the gene_id attribute in the GTF files analogous to the Name attribute in the .gff3 file? Because both featureCounts and htseq-count have the default id attribute as gene_id but I have .gff3 so I'm trying to figure out what I should change it to? Thanks.

sequencing • 1.7k views
ADD COMMENTlink modified 3.8 years ago by dariober11k • written 3.8 years ago by oma21930

GTF Format reference from Ensembl. GFF3 format reference from GMOD.

ADD REPLYlink modified 3.8 years ago • written 3.8 years ago by GenoMax96k
gravatar for dariober
3.8 years ago by
WCIP | Glasgow | UK
dariober11k wrote:

Correct me if I'm wrong but it may not be straightforward to apply htseq-count to gff3 (I don't know about featureCount). For typical differential gene expression, htseq-count needs an attribute that groups all the exons belonging to the same gene, the gene_id is usually the right one for gtf. But I don't think gff3 necessarily has such attribute.

See also How to modify a gff3 file for HTSeq?

ADD COMMENTlink written 3.8 years ago by dariober11k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1020 users visited in the last hour