Identifying Snps Contributed By 1000Genomes Project
2
0
Entering edit mode
9.9 years ago
Krisr ▴ 470

Hello,

I was wondering if anyone might know how to determine and retrieve a list of SNPs that have been contributed to the latest dbSNP132 build from the 1000 Genomes Project?

Thanks!

snp genome • 2.8k views
ADD COMMENT
0
Entering edit mode

Possibly answered at this related question.

ADD REPLY
1
Entering edit mode
9.9 years ago
Thomas ▴ 730

Many people at biostar recommend ANNOVAR annotation (http://www.openbioinformatics.org/annovar/) program...

Here you can also download a list of SNPs identified from the 1000G... even for different builds of dbSNPs with the command: annotate_variation.pl -downdb 1000g2010 humandb/

Best Thomas

ADD COMMENT
1
Entering edit mode

good point... Thanks. But still, I guess you can exclude the ones without the rs numbers?

ADD REPLY
0
Entering edit mode

the ANNOVAR tables for 1000genomes contain all 1000genomes data, not only the data that was submitted to dbSNP, which I think is what KirsR was asking for.

ADD REPLY
0
Entering edit mode

now that sounds like a good idea. I would still look for the data in the proper repository (ie dbSNP) just for quality/security concerns, but processing ANNOVAR's 1000genomes tables (you would have to decide which releases are you interested in) and filtering by non-rs-code-presence should definitely do the job.

ADD REPLY
0
Entering edit mode

Thanks everyone. I used the tables feature in the USC genome browser. There you can select ALL snps from dbSNP and select only those SNPs that where submitted by the 1000GENOMES -- by specifying them as a filter for "submitter"

This gave me ~ 15 million SNPs contributed from 1000GENOMES, specifically

ADD REPLY
0
Entering edit mode

Sorry to revive an old thread, but what settings do you use in the USC genome browser to get the 15m SNPs? When I use the entire genome and the 1000G set, I get an overflow error. 

ADD REPLY
1
Entering edit mode
9.9 years ago

if you go to the dbSNP summary page, you will be able to see a table with the description of the current data. you will find in it a column with the new submissions numbers, and if you follow the link of the new data for homo sapiens then you'll find out all the submissions made for dbSNP132 sorted by submitter's batches. there are 4 1000genomes batches there, and if you follow their links you will be able to download those batch submissions directly:

unfortunately, these numbers just mean that they WERE submitted by 1000genomes, but not that no one else did, so in case you are looking for SNPs that ONLY 1000genomes project reported I guess you'll have to crosscheck all these results.

ADD COMMENT

Login before adding your answer.

Traffic: 2252 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6