Question: vcf not indexing
gravatar for alex
3.8 years ago by
United States
alex180 wrote:

Trying to index vcf file but getting the following

tabix -p vcf dbsnp_138.hg19.vcf.gz
Not a BGZF file: dbsnp_138.hg19.vcf.gz
tbx_index_build failed: dbsnp_138.hg19.vcf.gz


Thoughts on how to proceed?  Thanks!

tabix • 6.5k views
ADD COMMENTlink modified 3.8 years ago by Sean Davis25k • written 3.8 years ago by alex180
gravatar for Sean Davis
3.8 years ago by
Sean Davis25k
National Institutes of Health, Bethesda, MD
Sean Davis25k wrote:

Looks to me like the dbsnp file is not bgzipped?  

gunzip dbsnp_138.hg19.vcf.gz
bgzip dbsnp_138.hg19.vcf
tabix -p vcf dbsnp_138.hg19.vcf.gz
ADD COMMENTlink written 3.8 years ago by Sean Davis25k

@Sean Davis you just saved me a lot of frustration. I found this after a few searches and it worked. THanks

ADD REPLYlink written 20 months ago by jespinoz20

I had the same problem: when I compressed with gunzip <file>.vcf tabix gave the error: tbx_index_build failed:<file>.vcf.gz but it worked with `bgzip <file>.vcf.

ADD REPLYlink written 3 months ago by marongiu.luigi350

Of course it does. bgzip does blockwise (therefore the b in bgzip) compression of the file, which tabix relies on. That enables tabix to quickly retrieve data by only very partially decompressing a (sometimes hugh) file, guided by the index. gzip does not do blockwise compression.

ADD REPLYlink modified 3 months ago • written 3 months ago by ATpoint12k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 658 users visited in the last hour