How to replicate vcftool's sliding window SNP density?
1
0
Entering edit mode
4.6 years ago
serpalma.v ▴ 80

Hello!

vcftools can calculate statistics such as pi and Fst using sliding windows.

I've noticed that windows in which the alternative allele is fixed are not taken into account. However, those appear in the VCF file.

For example, N_VARIANTS for pi at window_x is 20, but the region in the vcf file reports 25 SNPs, 5 of which are hom-alt in all samples.

I would like to find a way to calculate the SNP density as it comes in the vcf file using the same boundaries reported by vcftools.

Is there a way to make vcftools to do this?

Thanks!

SNP sequencing vcftools • 2.5k views
ADD COMMENT
2
Entering edit mode
4.6 years ago
Vitis ★ 2.5k

Create a window file in BED format and use 'bedtools intersect' to intersect VCF and the window file. I believe there is an option to report number of overlaps, which in your case, would be number of SNPs in the windows. Please see:

https://bedtools.readthedocs.io/en/latest/

ADD COMMENT

Login before adding your answer.

Traffic: 2645 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6