Question: Only the longest transcript while annotating with Annovar?
0
gravatar for mhmtgenc85
13 months ago by
mhmtgenc8530
Turkey
mhmtgenc8530 wrote:

Hi everyone, I was wondering if I can get only the longest trancript in the corresponding column of the Annovar annotation? -onetranscript argument chooses randomly but not the longest one.

So could you please help me? Thanks in advance

ADD COMMENTlink modified 13 months ago by Kevin Blighe39k • written 13 months ago by mhmtgenc8530
4
gravatar for Kevin Blighe
13 months ago by
Kevin Blighe39k
Republic of Ireland
Kevin Blighe39k wrote:

The answer from the author of ANNOVAR is this:

There has never been a consensus in the field which transcript should be used to represent a gene when multiple transcripts are available. The most popular approach is to use the longest transcript nowadays. However, in the medical genetics field, for certain specific diseases and specific genes, there are 'canonical' transcripts that everybody uses by default for historical reasons, and you will need to manually select this canonical transcript from ANNOVAR output file to communicate with the rest of the field.

[source: http://annovar.openbioinformatics.org/en/latest/misc/faq/]

In a way, he is correct, and I feel that the field should start to embrace (and report) multiple transcript isoforms more and more, even with the increased data load. There is too much reporting of variants on isoforms that may have minimal relevance in the tissue of study. Also, for many well-studied genes, like BRCA1, we have identified >10 isoforms; whilst, for other less-studies genes, we don't yet understand the alternate splicing patterns of the gene.

Note that VEP does allow you to output the canonical isoform, but to Ensembl the canonical is always the isoform with the longest CCDS: https://www.ensembl.org/Help/Glossary?id=346

On the last point, researchers even disagree about what canonical means. For some it is the highest expressed isoform in the tissue being studied, which may not necessarily be the longest. At least Ensembl's definition is broad-sweeping and covers all tissues.

Kevin

ADD COMMENTlink written 13 months ago by Kevin Blighe39k

I am more than happy and satisfied with your reply. So I'll keep all transcripts to be sure not to miss anything.

Thank you Kevin.

ADD REPLYlink written 13 months ago by mhmtgenc8530

No problem mhmtgenc85

ADD REPLYlink modified 12 months ago • written 13 months ago by Kevin Blighe39k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2522 users visited in the last hour