Question: gatk SelectVariants odd behavior
0
gravatar for jon.klonowski
13 months ago by
jon.klonowski70 wrote:

gatk SelectVariants is not working properly for me...

I have a file of SNP ids (i.e chr1:861219:G:C) delimited with /n and a vcf where the snp ID is also formatted chr:pos:ref:alt (chr1:861219:G:C).

The list file of SNPs were generated by subsetting a portion of the vcf SNPs (based on 1000G frequency) when I try:

gatk SelectVariants --variant File_name.vcf.gz -O Output_file_name.vcf.gz --keep-ids SNPS_of_interest.txt

I get a file of just the headers.

and then when I try:

 gatk SelectVariants --variant File_name.vcf.gz -O Output_file_name.vcf.gz --exclude-ids SNPS_of_interest.txt

The output is the same as the input excluding a zless | wc -l of ~10

ADD COMMENTlink modified 6 months ago by cricgpu0 • written 13 months ago by jon.klonowski70

You may want to post on the GATK Support Forum.

ADD REPLYlink written 13 months ago by Kevin Blighe59k
0
gravatar for cricgpu
6 months ago by
cricgpu0
cricgpu0 wrote:

Although not explicitly stated in the gatk SelectVariants docs, make sure your filtering files have the suffix .list. In you example change SNPS_of_interest.txt to SNPS_of_interest.list. This also hold true for other filters eg. --keepIDs.

Hope this helps!

ADD COMMENTlink modified 6 months ago • written 6 months ago by cricgpu0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1223 users visited in the last hour