Data rescaling for a more informative heatmap?
2
1
Entering edit mode
2.4 years ago
j_weld ▴ 10

I have a 1001 x 42 matrix of values. Most of the values are 0, and for those non-zeros, most of them are big than 50, but the maximum value is 15474.

heatmap • 765 views
ADD COMMENT
1
Entering edit mode
2.4 years ago
Asaf 10k

I would remove uninformative columns if there are any and, most importantly, transform the numbers to log scale, it will be much easier to track changes this way.

ADD COMMENT
0
Entering edit mode

A lot better but still not very

ADD REPLY
0
Entering edit mode

Yes, read the link in my answer, that you know why. Heatmaps on just log2 cluster by expression level, not by difference. Try to remove uninformative genes as well.

ADD REPLY
0
Entering edit mode

It is a lot better in my view. Try also removing severe outliers. Most middle columns are almost completely uninformative though, as far as I can see, yet dominate the heatmap.

ADD REPLY
0
Entering edit mode
2.4 years ago
ATpoint 82k

Standardize the log2 expression values using the Z-score. Here is some hands-on code with data examples:

Scaling RNA-Seq data before clustering?

ADD COMMENT

Login before adding your answer.

Traffic: 1218 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6