Can't find position in GnomAD VCF
1
1
Entering edit mode
3.1 years ago
bzg ▴ 40

I have a set of 700 mutations that I need to download from GnomAD. My first problem was: the mutation rs489693 can be found in the browser but it is not present in the VCF file (gnomad.genomes.r2.1.1.sites.18.vcf.bgz).

I searched also by the position 57882787.

Why this mutation is not present in the VCF file, but can be found via browsing? I believe I am missing some important concept. I hope someone can help me.

snv GnomAD VCF • 1.1k views
ADD COMMENT
3
Entering edit mode
3.1 years ago

I've got it. in gnomad.genomes.r2.1.sites.vcf.gz (release-181127)

$ bcftools view  gnomad.genomes.r2.1.sites.vcf.gz "18:57882787" | grep -v "##" | cut -f 1,2,3

#CHROM  POS ID
18  57882787    rs489693
ADD COMMENT
0
Entering edit mode

You used the complete vcf. I used only the 18th chromosome. Maybe that is the reason.

ADD REPLY
0
Entering edit mode

No, I found it using chr18:

$ wget -q -O - "https://storage.googleapis.com/gnomad-public/release/2.1.1/vcf/genomes/gnomad.genomes.r2.1.1.sites.18.vcf.bgz" | gunzip -c | cut -f 1,2,3 | grep 57882787 -m1
18  57882787    rs489693
ADD REPLY
0
Entering edit mode

thanks Pierre for taking your time to test it! The only difference I could conceive is the way I uncompressed the file. I used the nautilus utility. I realized that the size is much smaller (73Gb) while uncompressing with gunzip (100Gb).

ADD REPLY

Login before adding your answer.

Traffic: 2157 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6