Why it is recommended to run samples from all groups together while performing DESeq although the design variables are not used when estimating the size factors?
1
0
Entering edit mode
19 months ago
Amr ▴ 160

Why it is recommended to run samples from all groups together while performing DESeq although the design variables are not used when estimating the size factors?

DESeq2 GSEA R DEG • 674 views
ADD COMMENT
0
Entering edit mode

Gene variance is estimated using all samples, so the more samples you have the more accurate the estimate.

ADD REPLY
0
Entering edit mode

Do you mean that DESeq2 uses a specific measure of dispersion (α) related to the mean (μ) and variance of the data: Var = μ + α*μ^2. Based on the dispersion is higher for small mean counts and lower for large mean counts so have more samples is better? did I understand well? Thanks

ADD REPLY
0
Entering edit mode
19 months ago

The recommendation has to do with ensuring that no additional external factors affect the sequencing process.

Minor changes in the protocol, sequencing efficiency, library preparation, temperature of the day, person doing the sequencing can introduce "systematic errors". Unlike random errors, systematic errors do not get easier to correct with increasing sample sizes. The opposite is true in fact. Systematic errors get to become more relevant with increasing sample sizes.

The more samples and the more data is collected the tinier systematic errors can show up as true signal.

ADD COMMENT

Login before adding your answer.

Traffic: 1251 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6