Question

Reduce redundancy in a GO enrichment analysis

3

Entering edit mode

9.6 years ago

dago ★ 2.8k

I performed a GO enrichment analysis using topGO in BioC.

The analysis worked perfectly and now I get the following GO enriched (GO_ID) with the respective p-values (Fisher):

> head(cyt.MF.merged.combined[,c("GO_ID","Fisher")] )

 GO_ID  Fisher
1 GO:0000156 0.01314
2 GO:0004322 0.01705
3 GO:0004519 0.04516
4 GO:0004601 0.01917
5 GO:0005315 0.01705
6 GO:0008509 0.00769

However, many terms (e.g. GO:0016722, GO:0016724) are one the child of the other.

Since I do not need such a level of details, I was wondering if there is a tool to combine those terms that are related in a lower level of annotation. Of course, I would need to recalculate another p-value for the obtained parent node.

Has anyone experience with that?

Thanks in advance.

GO gene-enrichment Bioconductor • 5.2k views

ADD COMMENT • link updated 2.3 years ago by Ram 43k • written 9.6 years ago by dago ★ 2.8k

Ram · Answer 1 · 2014-09-26

1

Entering edit mode

9.6 years ago

russhh 5.7k

You could use GOstats, which provided a conditional=TRUE argument is provided to hyperGTest, can account for some of the conditional dependencies between GO terms

HTH

ADD COMMENT • link updated 2.3 years ago by Ram 43k • written 9.6 years ago by russhh 5.7k

0

Entering edit mode

Thanks for your answer. I know GOstats, and you are right it is a good tool for this kind of analysis. I was just wondering if is available another tool which can reduce redundancy after the enrichment is already performed with topGO.

ADD REPLY • link updated 2.3 years ago by Ram 43k • written 9.6 years ago by dago ★ 2.8k

Ram · Answer 2 · 2014-09-30

1

Entering edit mode

9.6 years ago

dariober 14k

Have a look at Revigo and/or EnrichmentMap (although revigo seems to be down just now...).

Dario

ADD COMMENT • link updated 2.3 years ago by Ram 43k • written 9.6 years ago by dariober 14k