Using snpsift to filter VCF by sample
0
0
Entering edit mode
2.0 years ago
curious ▴ 750

I am trying to identify controls in a VCF containing 10K samples and 40 variants. these controls should not carry any alternate alleles at any of the 40 variants.

based on the snpsift docs it look like I should be able to do something like this to keep sample homozygous reference for all variants in

cat $sample.vcf | java -jar SnpSift.jar filter "isHom( GEN[*] ) & & isRef( GEN[*] )"

I still get 10K samples output, which I know should not be the case as some sample have alternate alleles for at least one of the 40 variants

Ive tried to filter a VCF by sample with snpsift before, but have never been able to crack it. I can do this in python but usually avoid custom coding VCF stuff if possible. any help is greatly appreciated.

snpsift • 502 views
ADD COMMENT
2
Entering edit mode

still get 10K samples output,

the tool filters variants, not samples.

ADD REPLY

Login before adding your answer.

Traffic: 2695 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6