Hi everybody,
I found many threads concerning multiple rs identifiers for single SNP, but my problem is quite the opposite.
Using BioMart I try to do whole genome analysis for deleterious SNPs in different organisms. To my surprise I constantly get many locations and many different genes for same rs identifier. Could this be because of duplications, orthologues, paralogues...? Here is an example for "rs108080416" in Mus musculus on Y chromosome:
Variation.Name Chromosome.position.start..bp. Chromosome.position.end..bp. Strand Ensembl.Gene.ID
rs108080416 70464491 70464491 -1 ENSMUSG00000094616
rs108080416 71714552 71714552 -1 ENSMUSG00000100535
rs108080416 72224547 72224547 1 ENSMUSG00000096902
rs108080416 73691822 73691822 -1 ENSMUSG00000099782
rs108080416 74493272 74493272 -1 ENSMUSG00000101528
rs108080416 75218384 75218384 -1 ENSMUSG00000101146
rs108080416 75737883 75737883 -1 ENSMUSG00000100902
rs108080416 77715389 77715389 -1 ENSMUSG00000094647
rs108080416 65400236 65400236 -1 ENSMUSG00000093993
rs108080416 68272307 68272307 1 ENSMUSG00000096016
rs108080416 68810257 68810257 1 ENSMUSG00000100032
rs108080416 64495766 64495766 -1 ENSMUSG00000101766
rs108080416 57209714 57209714 -1 ENSMUSG00000101155
rs108080416 57728261 57728261 -1 ENSMUSG00000101399
rs108080416 61669220 61669220 -1 ENSMUSG00000100726
rs108080416 62213469 62213469 -1 ENSMUSG00000094821
rs108080416 63250982 63250982 -1 ENSMUSG00000100856
rs108080416 52802573 52802573 -1 ENSMUSG00000095606
rs108080416 55236225 55236225 -1 ENSMUSG00000094570
rs108080416 55754273 55754273 -1 ENSMUSG00000095135
rs108080416 22838749 22838749 1 ENSMUSG00000103919
rs108080416 26754269 26754269 1 ENSMUSG00000100055
rs108080416 27499390 27499390 1 ENSMUSG00000099541
rs108080416 49593757 49593757 -1 ENSMUSG00000095979
rs108080416 8280450 8280450 -1 ENSMUSG00000101928
Thank you in advance for all your help.
Best regards, Aco
Thank you Pierre, I get it now =)