Long variant splits to several variants by variant callers
0
1
Entering edit mode
2.3 years ago
avichaireich ▴ 10

Hello. I encountered an issue while interperting human variants. In some cases varinats that in IGV are seen as one long variants are split to several variants in the vcf file produced by GATK4 or other variant callers.

Here is an example:

Screenshot of IGV and GATK4 outputThe IGV clearly shows a deletion of 11 bases in the sequence, but in the output of GATK4 (in the right), this variant is split to two adjacent variants.

Another example that is a bit different: Another screenshot of IGV and GATK4 output The IGV shows a deletion of 2 bases in the sequence and a substitution of one base. These variants are reported as two variants in the output of GATK4 (in the right), but if I compare between the WT and mutent sequences, I get a single variant of delTTTinsA:

WT TTTCCGAAGCGAACCTCTCTGCCTTTTGTTTCTTTCTTTCGCTTCTTTTTCACTTTATTTTTTTCCTTCTGCTGCTTGGCATTAGAA

Mut TTTCCGAAGCGAACCTCTCTGCCTTTTGTTTCTTTCTTTCGCTTCTTTTTCACTTTAATTTTCCTTCTGCTGCTTGGCATTAGAA

Why GATK interpretate variants in this way? Thank you.

Variant calling GATK4 Freebays • 687 views
ADD COMMENT
0
Entering edit mode

Hi, did you figure out what is causing this split?

ADD REPLY
0
Entering edit mode

To troubleshoot that sort of thing, you need to re-run HaplotypeCaller with the -bamout argument so you can see what it was "thinking". If the call still doesn't make sense, make a post in the GATK forum so the GATK support team can look at it. Good luck!

ADD REPLY

Login before adding your answer.

Traffic: 1733 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6