How to replicate vcftool's sliding window SNP density?
1
0
Entering edit mode
3.1 years ago
serpalma.v ▴ 70

Hello!

vcftools can calculate statistics such as pi and Fst using sliding windows.

I've noticed that windows in which the alternative allele is fixed are not taken into account. However, those appear in the VCF file.

For example, N_VARIANTS for pi at window_x is 20, but the region in the vcf file reports 25 SNPs, 5 of which are hom-alt in all samples.

I would like to find a way to calculate the SNP density as it comes in the vcf file using the same boundaries reported by vcftools.

Is there a way to make vcftools to do this?

Thanks!

SNP sequencing vcftools • 1.8k views
2
Entering edit mode
3.0 years ago
Vitis ★ 2.5k

Create a window file in BED format and use 'bedtools intersect' to intersect VCF and the window file. I believe there is an option to report number of overlaps, which in your case, would be number of SNPs in the windows. Please see: