Question: RNA-seq cluster analysis in R
0
gravatar for MarjoryMollusc
2.1 years ago by
Australia
MarjoryMollusc50 wrote:

So I have my time series RNA-seq data, which I conducted some k-means clustering on in R to have a look at how it clusters. I am wondering if there are any good tools in R to then analyse the genes within each cluster, or the next step from the k-means clustering? Each cluster has around 50 genes in them.

I've been thinking about investigating nearby transcription factors (mouse cells) however the closest R tool I can find for doing that is pwOmics. I know that the TFcheckpoint webtool is a thing, however I was hoping to avoid copy-pasting forty different sets of genes in there from text files. Are there any similar R tools?

The main thing is that I am a bit unsure where to go from here, as there are a lot of genes within each cluster, and I have about four different plots with about 10 clusters in each. Any suggestions would be super useful! Thanks

clustering rna-seq k-means R • 2.5k views
ADD COMMENTlink modified 3 months ago by Biostar ♦♦ 20 • written 2.1 years ago by MarjoryMollusc50
1

Let's say each cluster is one gene set. Taking each of this gene set you could do GO and pathway analysis to understand the biology. Simplest way to do this is DAVID. Further downstream analysis depends on what question you want to address.

ADD REPLYlink written 2.1 years ago by Chirag Parsania1.8k

Annotation enrichment analysis is a typical way of looking at clusters of genes. There are different R packages for this, e.g. Bioconductor topGO for GO terms enrichment or, in simple situations, you could do it with the fisher.test() function.

As an aside, you may want to have a look at this post about clustering time series.

ADD REPLYlink written 2.1 years ago by Jean-Karim Heriche22k
0
gravatar for Zhilong Jia
2.1 years ago by
Zhilong Jia1.6k
London
Zhilong Jia1.6k wrote:
  1. clusterProfiler. Visualization of profile comparison section from Bioconductor package clusterProfiler. A quick solution for your question. see figure.

  2. co-expressed gene set enrichment analysis, cogena. cogena started with gene expression not clustered genes, While if you want to try serveral clustering methods, including kmeans, and enrichment analysis of each cluster, cogena is recommended. Just put the gene sets gmt file in the installation directory of cogena, R/x86_64-pc-linux-gnu- library/3.2/cogena/extdata, (see vignette). see figure and figure

ADD COMMENTlink written 2.1 years ago by Zhilong Jia1.6k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1276 users visited in the last hour