Question

fold change vs standardized effect size

9

Entering edit mode

9.0 years ago

mvlombardo ▴ 90

Hi all,

Just a quick question (as a newcomer to bioinformatics) regarding effect size in differential expression analysis. Why does the field opt for using fold change as a metric of effect size? Fold change doesn't take into account variability, whereas standardized effect size measures like Cohen's d do. So why doesn't the field report effect sizes that take into account variability?

To illustrate an example, say gene X has a mean of 7.20 in condition A and 7.60 in condition B. Fold change for condition B compared to condition A is 7.60/7.20 = 1.05. Say the standard deviation estimates on condition A is 0.09, while in condition B its 0.10. Computing Cohen's d on this, the effect size is somewhere around 4.2, which is a gigantic effect. Fold change and Cohen's d differ dramatically, so why not report effect size estimates that take into account variability rather than fold change?

Thanks,
Mike

RNA-Seq • 11k views

ADD COMMENT • link updated 14 months ago by Ram 43k • written 9.0 years ago by mvlombardo ▴ 90

6

Entering edit mode

9.0 years ago

Michael Love ★ 2.6k

DESeq2's posterior log fold changes are "reliable" effect sizes, that is, directly comparable across experiments, because the fold changes from genes with less information (low counts, high variability) are moderated toward zero using Bayes theorem. We lay out the argument in our paper here: http://genomebiology.com/2014/15/12/550 . We also provide Wald statistics in the results table, but this is not exactly what you are asking for (dividing by SE of the estimate, not SD of the data). [Edit]: you could use the expected variance formula for log counts to add your standardized effect size: V = 1/mean + dispersion. So divide log fold change by sqrt(1/mu + dispersion), where mu is the mean of normalized counts for the gene.

ADD COMMENT • link updated 14 months ago by Ram 43k • written 9.0 years ago by Michael Love ★ 2.6k

0

Entering edit mode

Hi @Michael! I see this is an old post but do you suggest to directly use the LFC values after applying lfcshrinkage() with apeglm method or do you still suggest to divide these LFC by sqrt(1/mu + dispersion)? In case its the latter, is the mu for each gene denoted by the column basemean and is the dispersion value extracted using dispersionFunction() or is it the lfcSE column? Many thanks!

ADD REPLY • link 4.6 years ago by manikg14 • 0

0

Entering edit mode

Hi @manikg14, did you resolve this issue? Having the same problem right now! Thanks

ADD REPLY • link 2.6 years ago by heather.jackson17 • 0

0

Entering edit mode

Heather and manikg, just wanted to check if you got answers on this? Michael? Reviewer #2 is asking for standardized effect sizes on my DESeq2 analysis, so it'd be great to make sure I'm using the correct LFC, baseMean and dispersion values. Thanks!

ADD REPLY • link 23 months ago by gsudre • 0

0

Entering edit mode

If you want moderated effect sizes use LFC.

If you want something like a t-statistic I would just use the Wald test statistic from results().

ADD REPLY • link 22 months ago by Michael Love ★ 2.6k

Ram · Accepted Answer · 2015-05-06

Historical inertia mostly.

Having said that, there's a slowly growing movement within bio-related fields to include confidence intervals in all estimate reports. While those obviously aren't standardized effect sizes, they serve a similar purpose (e.g., you can use them for power calculations, which more people should actually do). Actually, I find confidence intervals more useful than standardized effect sizes, but perhaps that's just me.

I should add that standardized effect sizes are not a panacea. My original background is ion channel biophysics, where we can robustly measure very very very very small changes in an extremely robust manner. I could trivially get Cohen's D values vastly larger than the one you showed, but the results would still be biologically meaningless. That's the thing with relying too much on any single number (this goes doubly for p-values), they're easy to point at and yell, "This is important!", when a finding isn't actually.