Filtering Vcf File For Fixed Substitutions
2
1
Entering edit mode
8.8 years ago
Rubal7 ▴ 800

Hello Everyone,

I have 2 VCF files for 2 separate populations and I would like to look for candidate functional sites that are fixed in each population and not present in the other VCFs. Ensembls variant effect predictor (VEP) can take VCF files directly. So is there a straightforward way to create a new VCF from each population VCF that only contains sites where all individuals carry the alternative allele (1/1)? I can't see an option on VCFtools that does this. This isnt exactly what I want but close enough I think to be very useful. Ideally I want a vcf file for a population containing only sites where they are fixed for alternate (1/1) and the other population is fixed for the reference allele (0/0). I think with VCFtools I can filter one file based on another, but not sure how to do the filtering to specify keep sites in one file if all individuals are 1/1 and all individuals in other file are 0/0. In other words only keeping sites with substitutions. Maybe there is a way in VCFtools I don't see.

Thanks in advance for your help!

Best regards,

Rubal

vcf filter allele frequency • 4.0k views
ADD COMMENT
3
Entering edit mode
8.8 years ago
Gabriel R. ★ 2.8k

Maybe you could use http://www.broadinstitute.org/gatk/gatkdocs/org_broadinstitute_sting_gatk_walkers_variantutils_CombineVariants.html to merge your multiple VCF files and run a some sort of grep for non-1/1 genotypes.

ADD COMMENT
3
Entering edit mode
8.8 years ago
Adam ★ 1.0k

In VCFtools, can't you just use the --non-ref-af and --max-non-ref-af options on each file separately, and then intersect the results?

ADD COMMENT
1
Entering edit mode

ah that makes sense, thank you!

ADD REPLY

Login before adding your answer.

Traffic: 2031 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6