Whats the point of genotype pruning before imputation?
1
1
Entering edit mode
4.8 years ago

Hey guys,

The point of imputation is to predict un-typed SNPs using D' or r2 values. However, it is standard to remove all SNPs in LD before imputation e.g. with plink command --indep-pairwise. May i ask what's the purpose of removing actually typed SNP's and then trying to "predict them back"? I'm for sure missing something, but still, curiosity is the point of my job heh

edit: I'm following Joni Coleman tutorial (he uses --indep-pairwise to prune and later remove these snp)

imputation plink --indep-pariwise r2 • 1.6k views
ADD COMMENT
1
Entering edit mode

Could you provide a link to that protocl:

standard to remove all SNPs in LD before imputation

ADD REPLY
0
Entering edit mode

Hey Zack, thanks for response! I’m following John Colemans tutorial https://github.com/JoniColeman/gwas_scripts

ADD REPLY
1
Entering edit mode
4.8 years ago
zx8754 11k

They are not pruning before imputation, pruning is a step before doing other QC steps, to calculate IBD and remove related individuals, or calculate PCs to remove outlier individuals based on ethnicity. As the final step of QC, there will be plink command something like:

plink --bfile original \
--remove individualsRelated individualsPCoutlier... individualsCallRate ... \
--exclude SNPsHWE... SNPsOtherQC, ... \
--recode oxford \
--out originalQC
ADD COMMENT

Login before adding your answer.

Traffic: 2885 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6