Genotype imputation - do SNP IDs of input files and reference panels matter?
1
0
Entering edit mode
2.8 years ago
Volka ▴ 180

Hi all,

I am currently working on performing genotype imputation using a mixture of reference panels and input files, on Minimac4. In some of my input VCFs, I had reannotated the SNP IDs, so that the IDs follow the format of CHR:POS:REF:ALT. In my other input files, it is a mix of RSIDs and CHR:POS formats as the SNP ID. The files are completely the same other than the SNP IDs.

I have already run imputation on for example, the files from Run A where SNP IDs are in the format CHR:POS:REF:ALT, and Run B where the SNP IDs are a mix. My question is, would I have to go back and standardize the SNP ID formats before imputation, or would the result be the same regardless? Does Minimac4 carry out imputation based on the SNP position only?

genotype SNP reference VCF imputation • 741 views
ADD COMMENT
0
Entering edit mode

My guess is that the IDs aren't important and the imputation is based upon the POS column (of course scaled by genetic distance).

ADD REPLY
2
Entering edit mode
2.8 years ago
LauferVA 4.2k

it very likely doesnt need that information, but only ways to be sure would be look at the code, or try it. It generates the imputation estimates based on linkage and yes location, although the base position is not really as important as having them in the correct order ... should only be a very short script to replace the old names with new if you have both files handy.

Hope that helps.

ADD COMMENT

Login before adding your answer.

Traffic: 2748 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6