Question: Adjusted P-value in GO analysis
gravatar for DNA
14 days ago by
DNA0 wrote:


I am doing GO analysis using software such as EnrichR, and all results come out with good P values but the adjusted P values are around 0.5 or even around 1. Is it a glitch or it is expected to be like this, and are the results then statistically significant?


go gene ontology p value • 183 views
ADD COMMENTlink modified 13 days ago by EagleEye4.9k • written 14 days ago by DNA0
gravatar for Philipp Bayer
14 days ago by
Philipp Bayer5.1k
Philipp Bayer5.1k wrote:

Since a GO enrichment analysis runs many, many, many tests (either one test for each gene or one test for each GO-term, can't remember) just by chance alone you would expect to see terms that seemingly have a significant p-value

Let's say you set your p-value cutoff of 0.05. You run one test and your p-value is below 0.05, nothing of significance. You keep on repeating tests, by test 20 you have a significant p-value, hooray! However, with a p-value cutoff of 0.05, you expect to see one significant test purely by chance after 20 tests (20*0.05 = 1). See slide 4 here for another example.

That's why software that runs many many tests runs multiple test correction to get around this problem, it adjusts the p-values based on the number of tests you ran (~the size of your input dataset).

In your case I would ignore the column of the unadjusted p-value and just look at the adjusted p-values, if they're not <0.05 there then they are not significant.

ADD COMMENTlink written 14 days ago by Philipp Bayer5.1k
gravatar for theobroma22
14 days ago by
theobroma221.1k wrote:

You can use the P-value...although the change is not really expected as it’s dependent on the mathematics underlying the adj. p-value calculation. Essentially, you should’ve, well based on Bayes, defined your cut-off value prior to doing the calculation. Anyway, if you should be required to use the adj. p-value, then typically the 0.05 is the standard cut-off. Using this metric assures there are less false positives compare to the P-value.

ADD COMMENTlink written 14 days ago by theobroma221.1k
gravatar for EagleEye
13 days ago by
EagleEye4.9k wrote:

Though what Philipp Bayer saying is true, the p-value metric is still used to filter the enriched process. In addition to that I would also recommend you to consider filtering by percentage of genes covered from particular pathway/biologicalProcess by your input list.

If you cannot find the information (% genes covered in pathway) from EnrichR, try GeneSCF.

A: GeneSCF and P-value Benjamini and Hochberg (FDR)

ADD COMMENTlink modified 13 days ago • written 13 days ago by EagleEye4.9k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 817 users visited in the last hour