Question: Annovar gene and cosmic variant annotations don't match
0
gravatar for jacobsen.jeremy
4.6 years ago by
United States
jacobsen.jeremy40 wrote:

I ran GATK variant discovery tools on my bam files and was interested in using the output to see how the variants may be affecting protein production. Additionally I wanted more information on what transcripts the variants can be found in.  I formatted the GATK file by removing the header and adding a column for the stop locus.  Then I ran annovar with:

perl annotate_variation.pl -geneanno -buildver hg19 gatk_files/... gatk_files/...

The script runs and leaves me with a gene annotated file called ..exonic_variant_function.  The problem with the annovar output is that the cosmic variants identifiers don't align with the gene.  For instance the first line in the file is:

line262 synonymous SNV NOC2L:NM_015658:exon16:c.C1843T:p.L615L, 1 881627 881627 G A

A cosmic search shows that p.L615L is a variant in KIAA1755 and not in NOC2L.  My end goal is to alter the RNA sequence for the transcripts where the variant is found and to code for variant modified proteins in order to build a modified peptide centric database.  From what I understand, the cosmic accession number gives the location of the variant from the start site which could make this process easy... if annovar was reporting on the correct variant->transcript. 

Any ideas??

 

Thanks,

Jeremy

 

 

rna-seq snp forum gatk • 2.0k views
ADD COMMENTlink modified 4.6 years ago • written 4.6 years ago by jacobsen.jeremy40
0
gravatar for karl.stamm
4.6 years ago by
karl.stamm3.5k
United States
karl.stamm3.5k wrote:

so KIAA1755 is on chr20 and NOC2L is on chr1. Your variant is on chr1, so it seems that everything is okay from ANNOVAR.

You mentioned a COSMIC ID# but haven't given one here. pL615L is not an ID#, but a mutation, Leu to Leu. Any protein whose 615th AA is Leu can have a 615L to L mutation. There's probably a lot of them, so you can't just google p.L615L and expect it to refer to the same gene.  Your transcript here is NM_015658, whose 615th AA should be Leucine (seen at Uniprot: http://www.uniprot.org/uniprot/Q9Y3T9 ) . 

KIAA1755 is not involved. 

ADD COMMENTlink modified 4.6 years ago • written 4.6 years ago by karl.stamm3.5k
0
gravatar for jacobsen.jeremy
4.6 years ago by
United States
jacobsen.jeremy40 wrote:

You are absolutely right Karl.  I allowed myself to be led astray by the siren calls of Google.  

Thanks,

Jeremy

ADD COMMENTlink written 4.6 years ago by jacobsen.jeremy40
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1379 users visited in the last hour