Question: 1000 Genomes Snp Annotation
1
gravatar for Ryan D
6.6 years ago by
Ryan D3.3k
USA
Ryan D3.3k wrote:

Since so many new SNPs are present in the 1000 Genomes data and latest builds of dbSNP, a large number of websites such as F-SNP and the SNP Info Web Server's function prediction module do not have annotation predicting the function of said SNPs. Going beyond things like PolyPhen or SIFT estimates of synonymous/non-synonymous intronic/intergenic, is there an integrated pipeline which can give details on things like: Is the SNP in a TF binding site?

Splice site?

miRNA binding site?

Conserved region?

Other regulatory region?

eQTL for genes?

The last of these is unlikely. F-SNP only runs up through dbSNP build 126 and the SNP Infor Webserver only covers later HapMap builds?

What is the latest integrated pipeline for annotating SNP function for 1000 Genomes variants with rs numbers? Is there one? Or must one use a collection of tools? What do those of you with a long list of SNPs from the more recent dbSNP builds do to annotate these things?

annotation snp • 4.0k views
ADD COMMENTlink written 6.6 years ago by Ryan D3.3k
4
gravatar for Jorge Amigo
6.6 years ago by
Jorge Amigo11k
Santiago de Compostela, Spain
Jorge Amigo11k wrote:

there are several annotation tools out there, publicly available to perform all these needs you have: snpEff, Variant Effect Predictor, SeattleSeq Annotation, ANNOVAR,... please refer to this previous question to get a broad description of some of the previously mentioned.

as a straight answer, I would definitely go for ANNOVAR, since it already has an embeded pipeline which provides you with all the things you are asking for, and generates a single csv excel readable file. the script is called summarize_annovar.pl, and it works pretty well, since it queries more than 10 locally previously downloaded databases and summarizes them all into a very convenient single file.

ADD COMMENTlink written 6.6 years ago by Jorge Amigo11k
1
gravatar for Laura
6.6 years ago by
Laura1.7k
Cambridge UK
Laura1.7k wrote:

The 1000genomes project provides coding and non coding annotation for its variants from the phase1

You can find both the annotation used and the annotated vcfs on the ftp site

ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/phase1/analysis_results/functional_annotation/

ADD COMMENTlink written 6.6 years ago by Laura1.7k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1083 users visited in the last hour