Extract species-specific SNPs from VCF files
1
0
Entering edit mode
14 months ago
GR ▴ 400

Hi All,

I have polymorphic sites for each species from different accessions in separate vcf files. ~4-5 million SNPs for each of the 5 closely related species. I need to extract species-specific SNPs from this data and was wondering if there are any tools that can be used to do this?

Thanks, RT

vcf SNPs • 483 views
ADD COMMENT
0
Entering edit mode
14 months ago
cfos4698 ★ 1.1k

If I'm interpreting you correctly, you want to compare the VCF files, ignore SNPs shared by two (or more) species, and instead retain only those SNPs specific to each species. This sounds like a job for bcftools isec.

Based on the --help information:

   # Extract records private to A or B comparing by position only
   bcftools isec A.vcf.gz B.vcf.gz -p dir -n -1 -c all
ADD COMMENT

Login before adding your answer.

Traffic: 2534 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6