Question: Filtering a vcf file with vcftools, where is my output?
gravatar for devenvyas
3.5 years ago by
University of Florida
devenvyas390 wrote:

Hello, I am a trying to filter some vcf files based off a set of rs#s ($FILE corresponds to chromosome #, so for now that is just 21, 22, and Y)

vcftools --gzvcf "sorted_AltaiNea.hg19_1000g."$FILE".mod.vcf.gz" --snps 330k.txt --out "filtered_AltaiNea.hg19_1000g."$FILE"_"

I run that code on the cluster, but then all I get out are


Which contain text as such:

VCFtools - v0.1.11
(C) Adam Auton 2009

Parameters as interpreted:
    --gzvcf sorted_AltaiNea.hg19_1000g.21.mod.vcf.gz
    --out filtered_AltaiNea.hg19_1000g.21_
    --snps 330k.txt

Using zlib version: 1.2.3
Versions of zlib >= 1.2.4 will be *much* faster when reading zipped VCF files.
Index file is older than variant file. Will regenerate.
Building new index file.
    Scanning Chromosome: 21
    Warning - file contains entries with the same position. These entries will be processed separately.

Writing Index file.
File contains 35104060 entries and 1 individuals.
Applying Required Filters.
Keeping sites by user-supplied list
After filtering, kept 1 out of 1 Individuals
After filtering, kept 5364 out of a possible 35104060 Sites
Run Time = 176.00 seconds

I do not have any vcf output, just those log files. Is there something I am doing wrong? Thanks!


snp vcftools vcf • 8.7k views
ADD COMMENTlink modified 3.5 years ago by chefer220 • written 3.5 years ago by devenvyas390
gravatar for chefer
3.5 years ago by
Pretoria, ZA
chefer220 wrote:

You should add the --recode flag to produce a new vcf file.

See example 2 on

ADD COMMENTlink written 3.5 years ago by chefer220
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1290 users visited in the last hour