Is there a ready list of all SNPs in UCSC?
4
0
Entering edit mode
9.9 years ago
m321.k321 • 0

I have list of few thousands snps name (RS*) and want to know their MAF.

I used the link http://genome.ucsc.edu/cgi-bin/hgTables

But i get errors that not all the SNPs are known.

This is why first I want to check which SNP exist in UCSC.

How can I do it? Is there a list of all known snps in the UCSC.

If there is such list that I can download, I will make a short code in phyton to compare my SNPs.

Thanks

SNP • 4.0k views
ADD COMMENT
4
Entering edit mode
9.9 years ago

Here's a one-liner to get a list of UCSC SNP names:

$ mysql --user=genome --host=genome-mysql.cse.ucsc.edu -A -D hg19 -e 'SELECT name FROM snp138Common' | tail -n +2 > snp138CommonNames.txt

If you want positional data, etc. there are other fields available in the snp138common table schema.

For example, if you want to know the type of functional variant for a given SNP name, add the func field to the query:

$ mysql --user=genome --host=genome-mysql.cse.ucsc.edu -A -D hg19 -e 'SELECT name, func FROM snp138Common' | tail -n +2 > snp138CommonNamesAndVariantTypes.txt
ADD COMMENT
3
Entering edit mode
9.9 years ago
Asaf 10k

In the table browser (the link you posted) choose: group:Variation -> track:all SNPs(138) and get the output, the fifth columns is the SNP name rs*.

ADD COMMENT
0
Entering edit mode

Hi Asaf,

each time that i try to run this with all SNPs the chrome is crashing...

ADD REPLY
1
Entering edit mode

solution: don't use chrome.

ADD REPLY
2
Entering edit mode
9.9 years ago
Emily 23k

Or, more easily, use Ensembl BioMart. Use the short variation database, filter by a list of IDs and get the MAF as an attribute. No coding involved.

ADD COMMENT
0
Entering edit mode
9.9 years ago
m321.k321 • 0

Emily_Ensembl,

when you say "get the MAF as an attribute" - fo you mean the 1000 genomes global MAF (ALL)?

Because I understood that the UCSC is the most accurate database and the MAF there is the most accurate?

ADD COMMENT
0
Entering edit mode

Yes, the 1000 genomes MAF will be good.

The dbSNP data in UCSC and Ensembl all comes from the same place - one possible difference is that Ensembl carry out an extensive QC process to get rid of anything that looks dodgy, whereas I believe UCSC lift it directly.

ADD REPLY
0
Entering edit mode

Thanks.

Can I get more info of the snp? like name of the gene or intergenic? inside intron or exon?

:)

ADD REPLY
0
Entering edit mode

Yep, that's all under Gene associated information.

ADD REPLY
0
Entering edit mode

I couldn't find gene name. can you help me to find this attribute? thanks

ADD REPLY
0
Entering edit mode

I'm afraid we don't have the associated gene name for variants. I'll speak to our team to see if we can get that added. In the meantime, you can get the ID then use the genes database to convert it.

ADD REPLY

Login before adding your answer.

Traffic: 3154 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6