Question: looking for an R package for GO term analysis
0
gravatar for Javad
9 weeks ago by
Javad0
Javad0 wrote:

Hello,

Can anyone suggest a simple R package to perform GO term analysis. Specially I am looking for a package that can produces publication quality figures and heatmaps regarding GO term results.

Thanks in advance

rna-seq next-gen • 259 views
ADD COMMENTlink modified 9 weeks ago by Kevin Blighe9.3k • written 9 weeks ago by Javad0
2
gravatar for Diwan
9 weeks ago by
Diwan450
United States
Diwan450 wrote:

Check bioc package "clusterprofiler". https://guangchuangyu.github.io/2016/01/go-analysis-using-clusterprofiler/

For heatmaps of GO enrichment, check "revigo". They provide the output as R script which you can load in R and manipulate colors etc. to suit your need.

ADD COMMENTlink written 9 weeks ago by Diwan450
2
gravatar for Kevin Blighe
9 weeks ago by
Kevin Blighe9.3k
London
Kevin Blighe9.3k wrote:

There are quite a few packages out there, and there was a thread on this already: GO enrichment analysis using R

See also my answer to this thread on GO term reliability: A: Go annotation reliability ?

One that is not mentioned in these threads is topGO

Another one with heatmaps is GOexpress

ADD COMMENTlink written 9 weeks ago by Kevin Blighe9.3k

Thank you very much for your response. Because I am new to this area, I would like to know a little bit about how GO term analysis works. There are few publications available but most of them are highly technical and hard to understand. Is there a good paper about it that explains the idea behind GO term analysis for a beginner?

ADD REPLYlink written 9 weeks ago by Javad0

Well, you basically have to be very careful with 'gene enrichment', as I would call it in the broadest form ('gene enrichment' includes 'GO analysis'). How it works is that each enrichment term has a number of genes associated with it.

For example:

DNA double strand break repair = TP53, ATM, BRCA1, BRCA2, etc.

If we then have data that shows that 3 of these genes are down regulated, then we can have high confidence that our DNA double strand break repair pathway is going to be adversely affected.

The number of genes assigned per term is key, and also the level of evidence behind each term:gene association. That's why I pasted the link to the other thread where I explain the evidence codes behind the GO terms: A: Go annotation reliability ?

This is just me explaining it at the fundamental level.

ADD REPLYlink written 9 weeks ago by Kevin Blighe9.3k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 645 users visited in the last hour