Question: looking for an R package for GO term analysis
gravatar for Javad
9 days ago by
Javad0 wrote:


Can anyone suggest a simple R package to perform GO term analysis. Specially I am looking for a package that can produces publication quality figures and heatmaps regarding GO term results.

Thanks in advance

rna-seq next-gen • 155 views
ADD COMMENTlink modified 7 days ago by Kevin Blighe3.2k • written 9 days ago by Javad0
gravatar for Diwan
8 days ago by
United States
Diwan380 wrote:

Check bioc package "clusterprofiler".

For heatmaps of GO enrichment, check "revigo". They provide the output as R script which you can load in R and manipulate colors etc. to suit your need.

ADD COMMENTlink written 8 days ago by Diwan380
gravatar for Kevin Blighe
8 days ago by
Kevin Blighe3.2k
Republic of Ireland (√Čire)
Kevin Blighe3.2k wrote:

There are quite a few packages out there, and there was a thread on this already: GO enrichment analysis using R

See also my answer to this thread on GO term reliability: A: Go annotation reliability ?

One that is not mentioned in these threads is topGO

Another one with heatmaps is GOexpress

ADD COMMENTlink written 8 days ago by Kevin Blighe3.2k

Thank you very much for your response. Because I am new to this area, I would like to know a little bit about how GO term analysis works. There are few publications available but most of them are highly technical and hard to understand. Is there a good paper about it that explains the idea behind GO term analysis for a beginner?

ADD REPLYlink written 8 days ago by Javad0

Well, you basically have to be very careful with 'gene enrichment', as I would call it in the broadest form ('gene enrichment' includes 'GO analysis'). How it works is that each enrichment term has a number of genes associated with it.

For example:

DNA double strand break repair = TP53, ATM, BRCA1, BRCA2, etc.

If we then have data that shows that 3 of these genes are down regulated, then we can have high confidence that our DNA double strand break repair pathway is going to be adversely affected.

The number of genes assigned per term is key, and also the level of evidence behind each term:gene association. That's why I pasted the link to the other thread where I explain the evidence codes behind the GO terms: A: Go annotation reliability ?

This is just me explaining it at the fundamental level.

ADD REPLYlink written 7 days ago by Kevin Blighe3.2k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1273 users visited in the last hour