Question: Setting a threshold for FDR correction
0
gravatar for kelly.wang135
7 months ago by
Korea, Republic Of
kelly.wang13550 wrote:

Hello, I am studying correction for multiple testing and have some trouble understanding the FDR.

  • significance level=0.05
  • number of tests=100

For Bonferroni correction, p-value should be multiplied by the number of tests. So, the raw p-value(0.001) * number of tests(100) = Bonferroni corrected p-value(0.1). So we can say that this p-value is not significant (when alpha=0.05).

Or, we can set the cutoff for Bonferroni correction as 0.05/100 = 0.0005. So the p-value of 0.001 is not significant, as this 0.001 is greater than 0.0005.

For FDR correction, p-value is calculated as: p-value * rank/number of tests. If this p-value ranks fifth among 100 tests, raw p-value(0.001) * 5/100 = FDR corrected p-value(0.00005). So we can say that this p-value is still significant.

My question is, then how can I set the cutoff for FDR correction? (as 0.05/100 in Bonferroni correction above)

ADD COMMENTlink modified 7 months ago by russhh4.9k • written 7 months ago by kelly.wang13550
0
gravatar for russhh
7 months ago by
russhh4.9k
UK, U. Glasgow
russhh4.9k wrote:

Your description of FDR correction isn't right. If you've tested M (independent [1]) hypotheses, and your gene has raw-p-value of P and rank I, then it's FDR adjusted p-value is PI/R ([2]). The way you described it, the FDR-adjusted p-value would always be lower than the raw p-value (which wouldn't make sense).

[1] - let's face it, genes don't behave independently, so it's a bit strange for us to treat hypotheses over gene expression to be independent, but that's a conversation for another day

[2] - there is a slightly different way of computing FDR_adjusted p-value: "The adjusted P value for a test is either the raw P value times m/i or the adjusted P value for the next higher raw P value, whichever is smaller"; for more details see http://www.biostathandbook.com/multiplecomparisons.html

ADD COMMENTlink modified 7 months ago • written 7 months ago by russhh4.9k

Thanks for your kind answer and correction. What I wondered was how to set a threshold for the FDR correction. As I understand, the cutoff should be determined based on the all p-values, not by just the number of tests as in the Bonferroni correction. The largest FDR-corrected p-value <=0.05 is 0.05 (Protein, ranks 5) in the example data from the link you wrote, so the cutoff value for FDR correction should be 0.042 (raw p-value of Protein). Am I getting it right? Well, this is my guess, but not confident with my answer. Thanks for your help, russhh!

ADD REPLYlink written 7 months ago by kelly.wang13550
1

I would recommend the excellent videos by Josh Starmer at UNC called "StatQuest". I find them to be very helpful and concise. Maybe this would be of use?

StatQuest: FDR and the Benjamini-Hochberg Method clearly explained

ADD REPLYlink written 7 months ago by RNAseqer 110
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1163 users visited in the last hour