Question: Figures generated from RNAseq
gravatar for kcavagnero
2.0 years ago by
kcavagnero0 wrote:

Hi all,

I want to preface this question by saying that I am a complete noob when it comes to rna seq data analysis and bioinformatics in general. I had some sequencing done by my university's core for 8 samples - 4 conditions with 2 replicates each, and I would like to show with figures the similarities between the four conditions and also the differences. Does anyone have any input on the best way to do this? I am thinking a PCA plot, heat map with all of the genes that are similar, and a heat map with all that are different. Does this sound like a reasonable approach? As far as I am concerned, heat maps would only show the fold change, but I think it would also be informative to show the absolute expression -- I am wondering if maybe not heat plots and instead use simple bar graphs with TPM values? Lastly, I am in a bit of a rush to get some figures made for a manuscript and if you have any recommendations as to how to do this in a quick, cost-effective, user-friendly web-based manner, please let me know. Any help would be much appreciated!

Thank you in advance!


rna-seq • 646 views
ADD COMMENTlink modified 2.0 years ago by Friederike6.8k • written 2.0 years ago by kcavagnero0

Follow griffith's tutorial on RNAseq data analysis: Not web based, but easy to follow. Featurecounts-DEseq2 tutorial can be followed from the blog:

ADD REPLYlink written 2.0 years ago by cpad011215k

are you comfortable with using R?

ADD REPLYlink written 2.0 years ago by Friederike6.8k

Nah. Trying to learn now, but not sure I'll have time as my boss wants this out asap.

ADD REPLYlink modified 2.0 years ago • written 2.0 years ago by kcavagnero0

I guess, then you should follow Lior Pachter's advice on how to write a paper in five minutes highlighting the Maya'an's Lab BioJupies Notebooks. If your experiment doesn't have a very complex experimental design, that may help.

ADD REPLYlink modified 2.0 years ago • written 2.0 years ago by Friederike6.8k
gravatar for Friederike
2.0 years ago by
United States
Friederike6.8k wrote:

Usually, you would start from a matrix of read counts (integers). If you import those into R, there are numerous packages that will allow you to achieve the kinds of plots you envision (and more).

Generally, you would:

  1. normalize the counts for differences in sequencing depth between the samples
  2. possibly account for the dependence of the variance on the mean, e.g. using DESeq2's rlog function

The easiest way to generate the plots you're after would then be the pcaExplorer package.

For more detailed code and explanations, you can look at Chapter 5 of our course material.

For determining logFC, you should definitely make use of the DESeq2 (or limma or edgeR) package as they will try to model the gene counts with fairly sophisticated approaches to get fairly robust results (that would be chapter 6 in the course notes).

ADD COMMENTlink written 2.0 years ago by Friederike6.8k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1772 users visited in the last hour