Should I do P-value adjustment in this case?
1
0
Entering edit mode
3.4 years ago

Hi there!!! I have done functional analysis of WGS metagenome data HUMAnN. I will be testing differentially abundant functional pathways among the control and test groups. I have around 400 features (pathways) to test. Considering, this, should I find out differentially abundant pathways depending upon adjusted P-value (q-value)? Same question I also have for MetaPhlAn taxonomical analysis, where I have to check for around 300 taxa.

Thanks

statistics fdr • 2.0k views
ADD COMMENT
1
Entering edit mode

You can report your results as:

  1. Raw p-values
  2. FDR correction at 0.05
  3. FDR correction at 0.10
ADD REPLY
1
Entering edit mode
3.4 years ago

Short answer is yes, it is always better to use pvalue correction when you test for multiple things. I don't know where is the limit exactly, but IMHO, 300 or 400 tests fully justify the use of pvalue correction.

ADD COMMENT
0
Entering edit mode

Thanks, CarloYague. But, when I did P-value adjustment I got only 2-3 significant results, which is around 30-35 when not adjusted. Can you please guess what may be the reason this happens?

ADD REPLY
2
Entering edit mode

When you perform multiple comparisons, your type one error rate (false positives) increases by 1-(1-alpha)^comparisons. For 300 comparisons, your false positive rate is 99.99%, so you expect a fairly large number of significant values even if the null hypothesis is true. As for why you have a low number of values after correction, there could be a few explanations.

If you performed Bonferroni correction, the correction is considered too conservative and you may be increasing your type II error rate (false negative). A less conservative method is often used, such as FDR.

If you got these results with a less conservative p-value correction, such as FDR, then that could signify a few things. Either the null hypothesis you are testing against is indeed true for most samples, or the effect size is too small to see for most samples given your sample size.

ADD REPLY
0
Entering edit mode

Thanks, Rpolicastro- I am working on already submitted datasets and I am expecting a lot of significant features as a similar type of test has been done in other populations. Definitely, there should be enough significant features. I have done a LefSe analysis with my dataset. LefSe first does the Kruskal-Wallis test followed by effect size estimation (LDA score > 2.0). I get 30-35 significant features (P <0.05) after LefSe whereas the number reduced drastically while P-value adjusted. At this level, it is confusing whether it is necessary to adjust P-value as LefSe is doing two-step filtering (Kruskal-Wallis, Effect Size), and also I am getting almost no significant result after adjustment. What is your take call in this condition?

Thanks

ADD REPLY
0
Entering edit mode

Is it only outputting the significant features? You need the p-value for all comparisons to compute FDR.

ADD REPLY
0
Entering edit mode

No, the output contains all the p-values. I have taken the table and calculated the adjusted p-value with p.adjust() in R.

ADD REPLY
1
Entering edit mode

If you are doing 400 multiple comparisons with a p-value threshold of 0.05, you would except roughly 20 significant tests even if the null hypothesis was true for all tests. Since you are only getting 30-35, It's likely that most of the significant p-values are just consequences of multiple comparisons and not a significant deviation from the null hypothesis.

ADD REPLY
0
Entering edit mode

What kind of adjustment method did you used?

ADD REPLY
0
Entering edit mode

Hi andres, I have done FDR correction (Benjamini-Hochberg). Thanks

ADD REPLY
0
Entering edit mode

If your pvalue threshold is 0.05, then you should expect about 20 false positives from 400 test. As stated by rpolicastro, Bonferroni correction is a bit over-conservative, so it makes sense that only a few of your 35 significant hit are still significant after correction.

ADD REPLY

Login before adding your answer.

Traffic: 2574 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6