Question

RNA-seq normalization and standardization

0

Entering edit mode

6.7 years ago

hvphan • 0

Suppose I have a matrix of gene counts, whose rows represent genes, and columns represent samples. For RNA-seq packages such as edgeR and DEseq2, the counts are normalized and standardized before DE analysis.

My question is about the standardization itself. After normalizing each column to adjust for sequencing depth and library size, I calculate the log2(normalized counts + 1). Then, is each column further standardized (to get mean = 0 and standard deviation = 1), or is the standardization done on each row only?

Thank you for your help.

RNA-Seq • 3.1k views

ADD COMMENT • link 6.7 years ago by hvphan • 0

0

Entering edit mode

For RNA-seq packages such as edgeR and DEseq2, the counts are normalized and standardized before DE analysis.

You don't have to do anything to the counts, just give them raw to DESeq2/edgeR.

ADD REPLY • link 6.7 years ago by WouterDeCoster 48k

0

Entering edit mode

No need of standardization, just give raw data. And, Deseq2 won't accept even your normalized data.

ADD REPLY • link 6.7 years ago by k.kathirvel93 ▴ 310

0

Entering edit mode

I just want to understand what the packages do to the raw counts in the background.

Do these packages calculate a normalization factor (like what Damian Kao said), scale the counts by the factor and do DE test? Do they standardize the normalized counts before DE test?

ADD REPLY • link 6.7 years ago by hvphan • 0

1

Entering edit mode

Then you should probably read the paper and vignette

ADD REPLY • link 6.7 years ago by WouterDeCoster 48k

score 0 · Answer 1 · 2018-10-26

0

Entering edit mode

6.7 years ago

Damian Kao 16k

Packages like edgeR and DESeq2 don't actually transform the counts for DE analysis. A normalization factor is calculated and then used in the DE test. There are good information in the absolute counts for each gene. Standardization is not used, otherwise that information would be lost.

ADD COMMENT • link 6.7 years ago by Damian Kao 16k