Question

How get Genetic Positions of SNPs?

0

Entering edit mode

2.7 years ago

Nemo • 0

I have some datasets related to different diseases like AML, etc. The data which I have include markers' genotypes (AA, AB, BB) and some other information like chromosome number, physical position, strand,etc. For some reasons I need to have the genetic positions of these snps as well. As I have searched so far, I could not find any exact solution for this (some suggested using 'MareyMap', but it also needs both physical position and genetic positions as input). Do you have any recommendation?

snps genotype distance genetic position • 1.8k views

ADD COMMENT • link updated 2.7 years ago by Sam ★ 4.7k • written 2.7 years ago by Nemo • 0

0

Entering edit mode

What does "genetic position" mean? Can you give us an example?

ADD REPLY • link 2.7 years ago by Ram 43k

0

Entering edit mode

for using the 'ldetect' package (for identifying the independent linkage disequilibrium blocks), the format of input should be : snpID, physical position, (cumulative) genetic position. (I have almost all the data except the third one. )

rs142251181 729358 0.0
rs116720794 729632 0.0
rs4951859 729679 3.20207111097e-05
rs148932570 729739 7.28982146541e-05

you can find paper for ldetect here

ADD REPLY • link 2.7 years ago by Nemo • 0

0

Entering edit mode

I don't want to read a paper to understand a term. Can you explain it here?

EDIT: I looked at the tool's manual/README, and it seems like a multi-step process. Where in this process is the "genetic position" used as an input without the data being generated by a previous step?

ADD REPLY • link 2.7 years ago by Ram 43k

0

Entering edit mode

exactly at the first step, for the first command you need to have a file as input with such information : snpID, physical position, (cumulative) genetic position. You can see the format of such file in the parameters of the first command with name 'example_data/chr2.interpolated_genetic_map.gz'.

ADD REPLY • link 2.7 years ago by Nemo • 0

0

Entering edit mode

I am guessing that the genetic position might have been the centiMorgan. One that I found is here: http://bochet.gcc.biostat.washington.edu/beagle/genetic_maps/ though you might want to make sure the build is correct.

ADD REPLY • link 2.7 years ago by Sam ★ 4.7k

0

Entering edit mode

I am not sure if the genetic positions of all snps in all populations are the same. Can you verify this? (to be more clear, I am skeptical if my samples in my AML dataset, have the same genetic positions as other population which are provided in your link.)

ADD REPLY • link 2.7 years ago by Nemo • 0

1

Entering edit mode

Each population has their own genetic map. As you've not stated your population, I just sent you the first one I found. Here is another one contain gentic map for different populations: https://github.com/joepickrell/1000-genomes-genetic-maps

Usually, it is ok to use the 1000G population map. If you want to calculate your own genetic map, you might need to use packages like LDhat or something like that.

ADD REPLY • link 2.7 years ago by Sam ★ 4.7k

0

Entering edit mode

Thanks Sam for your nice explanation, I would wonder if you let me know, since I do not have any family history of my population to infer the genetic position(distance), can I simply use the 1000G for any population? (Chinese, Asian,...)

ADD REPLY • link 2.7 years ago by Nemo • 0

1

Entering edit mode

Yes, as long as you pick one that is most closely resemble your population, then that should be fine.

ADD REPLY • link 2.7 years ago by Sam ★ 4.7k