Question: Annotation using annovar
0
gravatar for ivivek_ngs
5.2 years ago by
ivivek_ngs4.8k
Seattle,WA, USA
ivivek_ngs4.8k wrote:

Dear All,

I would like to address certain queries of mine. I have been using Annovar for a while now to annotate my variants from both vcf and text based format which are the output of GATK and VarScan. Off late I noticed that upno annotating I find more than 60% of the variants which are SNVs are on the intergenic and intronic regions and only few are in the exonic regions. Although my data has 75% of exon coverage. I have earlier tried to check with the how much of my reads are on the exonic regions with the Sure Select bed file that is used for the target enrichment and there I found it is well over 75% but then the variants which I find after all the statistical tests and removing the false positives I should be expecting nearly 70% lie on the exonic region after annotation right? For all my samples I have variants ranging from 200-220. These are somatic variants and out of them roughly 30% lie on the exonic region rest are in the genome_sumary.csv file of the annovar output. Is this a likely scenario or am missing out on something. Is it advisable to use some other annotation tool, if so which tool can be used directly to annotate the vcf format and text based variant format file. I would like to have some suggestions here.

 

Thanks

snp annovar • 3.0k views
ADD COMMENTlink modified 5.2 years ago by arno.guille400 • written 5.2 years ago by ivivek_ngs4.8k

Did you provide the target file when you called the variants with GATK and VarScan ?

ADD REPLYlink written 5.2 years ago by arno.guille400

Yes certainly I did use the target bed file while following the GATK pipeline, during the steps before BQSR. Infact what am doing is , am creating the realigned recalibrated bam file from GATK and then using that for variant calling for both GATK and VarScan and on this bam file already the target  enrichment file is used. I have no idea how to use target enrichment file with VarScan. Is there any way to do that in VarScan as well?

ADD REPLYlink written 5.2 years ago by ivivek_ngs4.8k
0
gravatar for arno.guille
5.2 years ago by
arno.guille400
France
arno.guille400 wrote:

You said "I have earlier tried to check with the how much of my reads are on the exonic regions with the Sure Select bed file that is used for the target enrichment and there I found it is well over 75% but then the variants which I find after all the statistical tests and removing the false positives I should be expecting nearly 70% lie on the exonic region after annotation right"

But this assumption is false, there are much more mutations in intronic regions because of selection pressure.

ADD COMMENTlink written 5.2 years ago by arno.guille400

So this is likely scenario that due to the selection the mutations would likely be more on the intronic regions, I am saying that the reads for the exome mostly corresponded to the exonic regions but then when you retrieve the variants out of them and try to reduce the variants to the more significant one and then annotate I find more in the intronic and the intergenic regions rather than the exonic regions. But should not be that the variation should be much more in the exonic regions. I am not denying the fact of the selection but then if its exome data then should not I have a fair amount of mutations on the exonic regions?

ADD REPLYlink written 5.2 years ago by ivivek_ngs4.8k

Did you provide the target file when you called the variants with GATK and VarScan ?

ADD REPLYlink written 5.2 years ago by arno.guille400

I also find a lot of significant hits in introns/mirna/intergenic regions, in the exome data i am analyzing.These could be interesting, but finding functional consequences of these are often very challenging, and people normally do not pursue such SNV further, unless you have a story in mind. 

arno.guilleBTW, i had no idea about the functionality about providing the target file when calling cariants with varscan. Which parameter did you use ?

ADD REPLYlink written 5.2 years ago by Chirag Nepal2.2k

Infact am unware of it as well as to how to use the target bed file while calling variants using VarScan

ADD REPLYlink written 5.2 years ago by ivivek_ngs4.8k

I don't use varscan but i thought there was a such option. It seems not to be the case.

ADD REPLYlink written 5.2 years ago by arno.guille400

No there is no such option in VarScan as far as I am concerned, only in case of the variant calling in GATK in BQSR you can use the target bed file and then during the Unified Genotype call or the Haplotype Calling you can do the same.

ADD REPLYlink written 5.2 years ago by ivivek_ngs4.8k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 769 users visited in the last hour