Michigan imputation server input preparation
Entering edit mode
6 months ago
rachelpachel ▴ 50

I'm running the imputation on some WGS VCF files, and I've never done it before, so I started with Chr 22 (GRCh37). I'm just worried because I started with 594949 SNPs in the file, and after the prep (using the Will Rayner perl script recommended here https://imputationserver.readthedocs.io/en/latest/prepare-your-data/ ) there's only 243108 for input into the imputation server, which is a high attrition rate! I tried to do some poking, though I don't know perl very well... best I canfigure is that most of the ones removed do not have an RSid.

My question: Is it OK to have such a reduced set of SNPs before imputation? Should I try to put on an rsid before running this? Thanks!

imputation • 364 views
Entering edit mode
5 weeks ago
vtran • 0

You can look at the QC_report after imputation to see which SNPs got excluded and why. Usually, a lot of SNPs are excluded because they are either monomorphic or have low call rates.


Login before adding your answer.

Traffic: 2360 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6