Variant Missing in VCF but Present in BAM Files: Discrepancy Identified via IGV Report in Join-Call Project
1
2
Entering edit mode
6 months ago
lunping98 ▴ 20

While following GATK's best practices for joint variant calling in a population, I noticed that some variants were missing in certain samples. However, when I checked the BAM files with IGV report, these samples actually contained the variants. For more details, please refer to the B.html file.igv-report

GATK Deepvariant BWA • 973 views
ADD COMMENT
0
Entering edit mode

How many reads carrying tthe REF allèle do you have at the very same location ?

ADD REPLY
0
Entering edit mode

Here are the reads carring the ref allele

chr1:3,044,701

Total Count 57

A 0

C 8 (14%, 6+, 2- )

G 49 (86%, 27+, 22- )

T 0

N 0

ADD REPLY
0
Entering edit mode

So there is only 14% of reads carrying the alt allele. May be your caller is too stringent.

ADD REPLY
2
Entering edit mode
6 months ago
GokalpC ▴ 170

It could be due to an assembly failure at the very same location. Can you bring up bamout files from those locations and check if the assembly has ever been there?

If not you can try different options to recover

--recover-all-dangling-branches true
--max-reads-per-alignment-start 0
--linked-debruijin-graph
--adaptive-pruning true

and finally

--use-pdhmm true
--pileup-detection true
--pileup-detection-enable-indel-pileup-calling true

Last 3 parameters are not compatible with GVCF output if that is what you are interested for joint calling. But you may enable them to diagnose your case.

Also check if those missing sites have call confidence of at least 30. GenotypeGVCFs default parameters do not emit sites that have GQ less than 30. You may wish to reduce that threshold to 10 to see if those sites get emitted.

Use version 4.6.1.0.

Regards.

ADD COMMENT

Login before adding your answer.

Traffic: 4527 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6