Problem understanding GT from Varscan's vcf output
0
0
Entering edit mode
5.5 years ago
ww22runner ▴ 60

Hello all,

I am trying to look for somatic variants in paired tumor/normal samples and have been using Varscan's somatic and processSomatic commands to obtain a vcf. However, I am having trouble understanding the output.

For somatic variants called with high confidence, an example is:

> #CHROM  POS     ID      REF     ALT     QUAL    FILTER  INFO    FORMAT  NORMAL  TUMOR 
>chr10   42400631        .       C       T       .      PASS    DP=41;SOMATIC;SS=2;SSC=14;GPV=1E0;SPV=3.2237E-2    GT:GQ:DP:RD:AD:FREQ:DP4 1/1:.:26:24:2:7.69%:12,12,2,0   0/1:.:15:10:5:33.33%:9,1,3,2

In this example, when I look at the normal sample, the DP value is 26 out of which 24 match the reference allele while 2 do not and thus the allele frequency of 7.69%. According to my understanding, Varscan's default parameter for min-freq-for-hom is 0.75 meaning if in this case the reference allele occurs in (100-7.69)% = 92.31% > 75%, the GT for normal sample should be 0/0 instead of 1/1?

If anyone could kindly help me understand such vcf outputs, I would greatly appreciate it.

Also, could anyone tell me how processSomatic command separates certain somatic variant calls as high confidence and what conditions it uses to do this?

Thank you

varscan • 1.3k views
ADD COMMENT

Login before adding your answer.

Traffic: 1507 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6