Hello everyone,
for fun I checked whether there are any differences between PLINK and PASW/SPSS when doing a logistic regression (and standard parameters) on a binary trait. Using the same dataset, I get quite different significant SNPs - they sometimes overlap slightly, but sometimes the most significant SNPs in PLINK don't even appear in the model that PASW builds.
For testing I don't use any SNP-specific cleaning-steps etc., just wanted to check how both programs compare.
Does anybody have any explanation?
Thanks!
So in doing a GWAS, I should "trust" Plink more than SPSS?
You could trust the plink logistic reg. approach if you believe in additive effects of the rare allele (more alleles -> more effect in almost linear way). If you believe there could be other possible explanation for your outcome (e.g. recessive, dominant, co-dominant...) you could treat SNPs as factors (each allele has its own effect on the outcome).
p.s. if you do not worry about covariates like age sex etc... but only about genetic component consider the --model option of plink