Question: Question about dbSNP rs #s
gravatar for devenvyas
6.2 years ago by
Stony Brook
devenvyas650 wrote:

Summarized version:

I have an old data set from 2008 from a set of HumanCNV370-Quads, and I have downloaded relatively recent a set of extended-VCFs from the Altai Neanderthal and Denisovan genomes. I want to compare the data between the two. I know the genome coordinates for any given base can shift from assembly to assembly, but will the rsID for a given SNP change if and when the coordinate changes?


In-depth version:

I have SNP data from 64 samples at ~330,000 rs ids (I know there is no mt/Y data, I am pretty sure this is all autosomal). The data is from an old set of HumanCNV370-Quads from 2008. I don't have the genomic coordinates.

I have download two sets of VCF files from the Denisovan 30× and Altai Neanderthal 50× coverage genomes (available here and here These files are in a cumbersome extended VCF format described here ( page 16) and here ( page 14). These files have rsIDs labeled for most sites (of course though not all sites in these genomes have been assigned rsIDs).

I also have Illumina data from 171 samples (the libraries enriched for NRY- and mtDNA), which I am now have in raw, un-rsID-ed, unfiltered VCFs, which I am trying to bring into mix, but I am going to ignore them for now (I have a thread on them here

For the 330k sites, I have the alleles for the common chimpanzee from the 1000G (phase1_release_v3/20101123) and the dbSNP build 141 for most of the sites from a friend of a friend. The goal is to use f4 statistics to calculate Neanderthal ancestry estimates.

Anyways, to my question, would the rsIDs from the SNP chip still correspond to the rsIDs that I find in extended VCF files? If not, what would I have to do to make them match up?


snp dbsnp coordinates rs genome • 2.2k views
ADD COMMENTlink modified 6.2 years ago by Katie D'Aco1.0k • written 6.2 years ago by devenvyas650
gravatar for Katie D'Aco
6.2 years ago by
Katie D'Aco1.0k
Katie D'Aco1.0k wrote:

rsID's will stay the same from genome build to genome build, even if the genomic coordinates change. The one gotcha to your plan I can think of is if the rsID was removed from dbSNP or merged with another rsID.

ADD COMMENTlink written 6.2 years ago by Katie D'Aco1.0k

Do you know of an easy way to update rsid's that have been retired?

ADD REPLYlink written 3.0 years ago by eric.kern13190
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1280 users visited in the last hour