Question: Selecting SNPs to impute in Plink
0
gravatar for hessjl
5.9 years ago by
hessjl70
United States
hessjl70 wrote:

Hi all,

I want to construct a workflow that will allow me to impute small batches of SNPs that were untyped in my GWAS data (in .bed .bim .fam fomat).  

Is it possible to meaningfully subset my reference panel (hapmap release 23) so that I only impute the ~8000 missing SNPs? Would it be erroneous to start by subsetting the reference by tag SNPs to those 8000 SNPs, then merging the reference and GWAS data, then imputing genotypes?

Thanks for the help in advance! 

 

plink impute • 1.9k views
ADD COMMENTlink modified 5.9 years ago by chrchang5237.3k • written 5.9 years ago by hessjl70
1
gravatar for chrchang523
5.9 years ago by
chrchang5237.3k
United States
chrchang5237.3k wrote:

Is there a reason you still want to use Plink here?  Its imputation algorithm was already considered obsolete 5 years ago: see http://www.ncbi.nlm.nih.gov/pubmed/19089453 .

I suggest choosing a more effective imputation tool and applying it to your data.  BEAGLE, IMPUTE2, and MaCH continue to be reasonable choices, and it should be straightforward to convert your data to their formats.  (1000 Genomes phase 3 reference panels are also now available for all three programs; they should yield more accurate results than HapMap release 23.)

ADD COMMENTlink written 5.9 years ago by chrchang5237.3k

Thanks for this insight! As subtext, are you indicating that whole GWAS imputation is the way to go; or can I still pre-filter my reference set to expedite the imputation turnaround? 

ADD REPLYlink written 5.9 years ago by hessjl70

Impute2 should support this sort of variant filtering: see http://mathgen.stats.ox.ac.uk/impute/impute_v2.html#ex4 .

ADD REPLYlink written 5.9 years ago by chrchang5237.3k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2048 users visited in the last hour