Question: Why quantitative design are preferred GWAS approach
gravatar for
13 months ago by
ste.lu50 wrote:


I was writing something about GWAS, however is not really my field and so lot of reading. I encountered this statement in Bush and Moore, 2012 (Chapter 11: Genome-Wide Association Studies, 2012):

There are two primary classes of phenotypes: categorical (often binary case/control) or quantitative. From the statistical perspective, quantitative traits are preferred because they improve power to detect a genetic effect, and often have a more interpretable outcome. For some disease traits of interest, quantitative disease risk factors have already been identified.

Can anyone help me to understand why quantitative trait has more power (even with some formulas it will be great)? are they referring to QTL somehow?

Thank you very much

statistical analysis gwas qtl • 710 views
ADD COMMENTlink modified 12 months ago by Kevin Blighe49k • written 13 months ago by ste.lu50

I think one reason could be quantitative traits follow certain distribution like normal distribution so they could be statistically tested against the null hypothesis for example t student test while qualitative traits usually have to be tested by non parametric tests as these data don't follow certain distributions.

ADD REPLYlink written 13 months ago by Za120

Do you think parametric or non parametric test makes any difference with the number of GWAS? I am not saying is not the case, but, but I want just to understand your point.

Thanks for you answer

ADD REPLYlink written 13 months ago by ste.lu50

Actually I also don't know deeply but I only know quantitative data can be model and tested with more flexibility , I am sorry :(

ADD REPLYlink modified 13 months ago • written 13 months ago by Za120
gravatar for Kevin Blighe
12 months ago by
Kevin Blighe49k
Kevin Blighe49k wrote:

Quantitative (continuous) traits are preferred because they contain more information. However, we are strictly referring to quantitative traits that already follow a data distribution that can be modeled in whatever it is your proposed statistical test. Usually, this would mean a Gaussian / normal distribution. If you have a very weird variable that has a skewed distribution that cannot be modeled, then changing it to qualitative (categorical) would be better.

Think about it: we have a beautiful variable of n=1000000 and it 'perfectly' follows our expected distribution (in R):

million <- rnorm(1000000)


Now lets dichotomise it:

million[million<=2] <- 0
million[million>2] <- 1


They look completely different and you can see that we have lost so much information. Whilst we can treat this new data as categorical, you can clearly appreciate at the same time that we have thrown out so much information.


It is this lost information that increases error (type II) and, therefore, reduces statistical power. Remember that, generally speaking, statistical power is the level of our ability to identify an effect when an effect is actually present in our cohort. You can therefore appreciate that, by throwing out useful information, we are reducing our power.


PS - wrote a bit more here: A: Log-tranformation and GWAS

ADD COMMENTlink modified 9 months ago • written 12 months ago by Kevin Blighe49k

Great answer, Thank!

ADD REPLYlink written 12 months ago by ste.lu50
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1161 users visited in the last hour