which normalized counts is better from DESeq2
1
5
Entering edit mode
7.3 years ago
weixiaokuan ▴ 120

Hi,

I may have a silly question. How should I get normalized counts from DESeq2? If I understand it right, rlog and vst are both normalized counts which one is better? or Should I just use "counts(dds,normalized=TRUE)" instead of rlog or vst. Thank you.

-X

RNA-Seq DESeq2 • 12k views
0
Entering edit mode

Define "better".

0
Entering edit mode

Good point. Actually, I should say advantages vs dis-advantages of each method.

Also, if I want to use these normalized data to calculate fold change which one is close/same to the one calculated by the "results" function of DESeq2?

Thank you.

15
Entering edit mode
7.3 years ago

For things like calculating fold-changes, one would normally use the output of count(dds, normalized=T). The other two you mentioned and so much count but rather transformed counts. These are useful for things like making a heatmap, or PCA, or anything that involves clustering/imaging.

BTW, none of these will give you the same fold-change that DESeq2 does, because you're not using a prior distribution.

0
Entering edit mode

Devon,

Thank you for answering my questions. But I am just wondering why DESeq2 doesn't use rlog data matrix or vst data matrix to calculate the log2FC; but instead just using scaled (size factor) and normalized (dispersion) reads to calculate log2FC. I cannot find any reference for explaining such a practice. Do you have any insight on this?

-X

0
Entering edit mode

You have the question backwards. The question you should be asking is what would be gained by using heavily modified data for statistics rather than the raw data. Raw counts follow a negative binomial distribution, which is relatively easy to deal with. Truth be told, DESeq2 doesn't directly scale and normalize the counts, it just includes terms for them in its model (the scaling becomes a weight and dispersion is profiled out (I think edgeR does this too)). The details for this are in the DESeq2 paper.

0
Entering edit mode

I am curious to know, while obtaining normalized count from "count(dds, normalized=T)", Does design provided from folloiwing command has an effect on normalized count?

dds <- DESeqDataSetFromMatrix(countData = x, colData = ss.edesign, design = ~Condition)