Ensembl Genome annotations versions
1
0
Entering edit mode
7.2 years ago
ofonov • 0

I am running a GATK pipeline on RNA-seq data. For the alignment I have used GATK-bunle genome - human_g1k_v37_decoy.fasta. Now I would like to use the alignment files with featurecounts to obtain TPMs, which requires a GTF annotation file. However GATK bundle does not include annotation.

Which version of the annotation should I use? Will it be OK to use the latest version of the Ensembl annotation - GRCh37 release 87? Or should I go some versions down?

Annotation Ensembl Genome • 2.0k views
ADD COMMENT
2
Entering edit mode
7.2 years ago
Emily 23k

The latest data is always the most up-to-date and highest quality.

ADD COMMENT
0
Entering edit mode

Thank you for the reply. Are Ensembl annotations based on the primary assembly (e.g GRCh37), or on the patched assemblies? Are there any difference between versions of annotation, except the updated features?

ADD REPLY
0
Entering edit mode

They're on the patched assemblies, which will be the difference.

ADD REPLY
0
Entering edit mode

My concern was that if I used earlier version of the genome for the alignment, than the one on which the latest annotation was based, it might lead to some artefacts, since some features might have been moved due to the applied patches. Perhaps, I am overcomplicating things?

ADD REPLY

Login before adding your answer.

Traffic: 3555 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6