Question: Best values to use for a heatmap comparing conditions comprised of biological replicates?
gravatar for kelgalla
6.6 years ago by
United States
kelgalla0 wrote:

I have some RNASeq data, and for the samples, I have counts, normalized counts (RPKM), or variance stabilized counts from DESeq.  I have used DESeq on the various conditions to identify differentially expressed genes, and so I also have the baseMean for each condition.  It seems you can create a heatmap of the samples using counts, normalized counts, or variance stabilized counts, though the variance stabilized counts are likely are the best.

However, if I wanted to make a heatmap where I summarize the samples into their conditions (treat all biological replicates as one condition), then what are the best values to use?  Do I use the baseMean, the mean of the variance stabilized counts, or the mean of the RPKM values?

Because one of the conditions is a control, and all other conditions are being compared to this same control, it is also possible that I might use the fold change, log2 fold change, or the modified log2 fold changes as described by DESeq?

heatmap clustering deseq rnaseq • 6.2k views
ADD COMMENTlink modified 6.5 years ago by smjazayeri20 • written 6.6 years ago by kelgalla0
gravatar for Michael Dondrup
6.6 years ago by
Bergen, Norway
Michael Dondrup48k wrote:

You could do a multidimensional scaling plot (MDS) on all samples and compare all the different methods, e.g. using the plotMDS function (limma, edgeR). Which MDS plot conveys most biological relevance in sample pairings? I bet on log2 fold change or log2 library-size normalized abundances (CPM). In my experience taking log is essential.  

ADD COMMENTlink modified 6.6 years ago • written 6.6 years ago by Michael Dondrup48k
gravatar for Ming Tang
6.6 years ago by
Ming Tang2.6k
Houston/MD Anderson Cancer Center
Ming Tang2.6k wrote:

you can use any value ( counts normalized to library size, variance stabilized counts, RPKM, log2 Fold change etc....) for making heatmap. heatmap is just using color to represent numbers. for control and treated samples, you may selected genes that are significantly changed (up or down based on adjust-pvalue) and then got the person-correlation distance matrix and make heatmap by heatmap.2.

ADD COMMENTlink written 6.6 years ago by Ming Tang2.6k
gravatar for smjazayeri
6.5 years ago by
smjazayeri20 wrote:

 To show the results in Excel you can easily plot your RT-qPCR vs RNA-Seq using Log2FoldChange (I think this value is better as DESeq gives you foldchange value more than 0 (between 0 and 1 means underexpression and you have to change it to negative value) so it is not possible to use FoldChange to show transcript abundance toward negative values).

 For heatmap you can use MapMan/PageMan for each organism you are working just do a blast with one of the organisms with which your under study organism has close relation in order to have the gene IDs (e.g. for plants use Arabidopsis o Oryza sativa).

Of course for heatmap, you can find heatmap3 as recently it is published.

Best regards

ADD COMMENTlink modified 23 months ago • written 6.5 years ago by smjazayeri20
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2132 users visited in the last hour