Question: vcf not indexing
gravatar for alex
4.3 years ago by
United States
alex180 wrote:

Trying to index vcf file but getting the following

tabix -p vcf dbsnp_138.hg19.vcf.gz
Not a BGZF file: dbsnp_138.hg19.vcf.gz
tbx_index_build failed: dbsnp_138.hg19.vcf.gz


Thoughts on how to proceed?  Thanks!

tabix • 8.3k views
ADD COMMENTlink modified 4.3 years ago by Sean Davis25k • written 4.3 years ago by alex180
gravatar for Sean Davis
4.3 years ago by
Sean Davis25k
National Institutes of Health, Bethesda, MD
Sean Davis25k wrote:

Looks to me like the dbsnp file is not bgzipped?  

gunzip dbsnp_138.hg19.vcf.gz
bgzip dbsnp_138.hg19.vcf
tabix -p vcf dbsnp_138.hg19.vcf.gz
ADD COMMENTlink written 4.3 years ago by Sean Davis25k

@Sean Davis you just saved me a lot of frustration. I found this after a few searches and it worked. THanks

ADD REPLYlink written 2.2 years ago by jespinoz20

I had the same problem: when I compressed with gunzip <file>.vcf tabix gave the error: tbx_index_build failed:<file>.vcf.gz but it worked with `bgzip <file>.vcf.

ADD REPLYlink written 9 months ago by marongiu.luigi380

Of course it does. bgzip does blockwise (therefore the b in bgzip) compression of the file, which tabix relies on. That enables tabix to quickly retrieve data by only very partially decompressing a (sometimes hugh) file, guided by the index. gzip does not do blockwise compression.

ADD REPLYlink modified 9 months ago • written 9 months ago by ATpoint19k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 865 users visited in the last hour