RNA-seq normalization and standardization
1
0
Entering edit mode
6.0 years ago
hvphan • 0

Suppose I have a matrix of gene counts, whose rows represent genes, and columns represent samples. For RNA-seq packages such as edgeR and DEseq2, the counts are normalized and standardized before DE analysis.

My question is about the standardization itself. After normalizing each column to adjust for sequencing depth and library size, I calculate the log2(normalized counts + 1). Then, is each column further standardized (to get mean = 0 and standard deviation = 1), or is the standardization done on each row only?

Thank you for your help.

RNA-Seq • 2.7k views
ADD COMMENT
0
Entering edit mode

For RNA-seq packages such as edgeR and DEseq2, the counts are normalized and standardized before DE analysis.

You don't have to do anything to the counts, just give them raw to DESeq2/edgeR.

ADD REPLY
0
Entering edit mode

No need of standardization, just give raw data. And, Deseq2 won't accept even your normalized data.

ADD REPLY
0
Entering edit mode

I just want to understand what the packages do to the raw counts in the background.

Do these packages calculate a normalization factor (like what Damian Kao said), scale the counts by the factor and do DE test? Do they standardize the normalized counts before DE test?

ADD REPLY
1
Entering edit mode

Then you should probably read the paper and vignette

ADD REPLY
0
Entering edit mode
6.0 years ago

Packages like edgeR and DESeq2 don't actually transform the counts for DE analysis. A normalization factor is calculated and then used in the DE test. There are good information in the absolute counts for each gene. Standardization is not used, otherwise that information would be lost.

ADD COMMENT

Login before adding your answer.

Traffic: 916 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6