Vg Call, not calling SNP in all samples
Entering edit mode
19 months ago


I have been trying to use Vg call to call variants from pooled samples. I am interested in using it for indels but I first compared the vg call frequency for two biaalelic SNPs whose frequency was also estimated using a different method (PoolSNP). The estimates from PoolSNP seem quite reliable but only for biallelic SNPs. Vg call produces similar AF for the variants it has called, see plots below. However, it does not call the variant in most samples, again in plot. The minimum sample depth in the PoolSNP calls for both sites is >40. My feeling it has something to do with the fact that Vg call work on samples individually without joint calling. I also attached my approach below, I am using vg-v1.26.1.

enter image description here

vg map -x x.xg -g x.gcsa -f 1.fq.gz  -f  2.fq.gz -t 32 -Z 100 > mapped.gam

vg gamsort mapped.gam -t 32  -i mapped_sorted.gam.gai >mapped_sorted.gam

vg chunk -x x.xg -c 10 -p 3:3716672-3718990 -g -a mapped_sorted.gam -O pg > 2>chunk_err

vg augment chunk_0_3_3716352_3719336.gam -s -A chunk_1_aug.gam >

vg snarls > chunk_1_aug.snarls

vg pack -x -g chunk_1_aug.gam -o chunk_1_aug.pack

vg call -a -r chunk_1_aug.snarls -k chunk_1_aug.pack -s sample > sample_calls.vcf

I only show for two SNPs but this happens for many others as well and it never the same samples that are called by Vg call. Please let me know if you have any insights into what is going on. Thanks so much for your time, Arun.

Vg call vg tools vg • 726 views

Login before adding your answer.

Traffic: 2417 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6