Help With Genotyping In My Vcf File
1
0
Entering edit mode
10.2 years ago
Paul ★ 1.5k

Dear bio-friends,

Could you please help me how to explain genotyping in my vcf file. I have Illumina data and vcf caller was GATK. My variant frequency (Alt variant freq) is 99.7%. DP = 4622 (AD = 16, 4606) - so I would expect that this sample is alternate homozygous. But when I check PL value, which is - PL = 1655,0,323 - after calculating my likelihood -

REF= G ALT= A

P(D|GG) = 10 ^ -165.5 = small
P(D|AG) = 10 ^ 0  = 1
P(D|AA) = 10 ^ 32.3 = small

we can see it is heterozygous. Can anybody help me how to interpret my result? How it is possible that likelihoods show me heterozygous and coverage and VF show me homozygous?

Here is part of my vcf file:

chr13    32899193    .    G    A    1625.01    PASS    AC=1;AF=0.5;AN=2;DP=4622;QD=0.35;TI=NM_000059;GI=BRCA2;FC=Silent    GT:AD:DP:GQ:PL:VF:GQX    0/1:16,4606:5000:99:1655,0,323:0.997:99

Thank you for any explanation.

Paul.

vcf genotype • 3.1k views
ADD COMMENT
0
Entering edit mode

Maybe a quality issue? Someone else will probably have a better answer but given that the Genotyper takes into consideration factors beyond just the number of reads supporting a particular variant, that may be part of the issue. The allelic depth still seems extremely skewed for this to be the answer but that's all I can think of. But there doesn't seem to be any quality filters triggered at that site.

ADD REPLY
0
Entering edit mode

Yeah, true is, that other variants have much more QUAL value... But filter shows PASS and Im not sure how to explain my result.. But thanks for point to QUAL value..

ADD REPLY
0
Entering edit mode

I'd also check for Strand Bias. If a majority of the16 reference allele reads are on one strand, that could impact the genotype call. Checking this region in IGV might give you a good idea.

ADD REPLY
0
Entering edit mode

Thanks for reply.. My filtering is set up for checking Strand Biases.. So there should not be any problem with sb.

ADD REPLY
2
Entering edit mode
10.0 years ago
Paul ★ 1.5k

Hi guys,

the problem was with old version GATK - I was consulting it on GATK forum. Be careful, because all MiSeq system using this old version and you can have the same issue.

Have a good day.

ADD COMMENT

Login before adding your answer.

Traffic: 2468 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6