Unequal sample sizes Deseq2
1
0
Entering edit mode
4.1 years ago
Payal ▴ 160

Hi,

I have two groups with 3 replicates each for differential gene expression analysis.

Control - Sample1 - Sample2 -Sample3

Experiment -Sample1 -Sample2 -Sample3

However, Sample3 in Control has high duplication rate almost double reads than other samples. For differential gene expression in Deseq2, can I exclude the Sample3 in control and compare Controls (Sample1, Sample2) vs Experiment (Sample1, Sample2, Sample3). Will Deseq2 library size normalization take care of unequal sample sizes?

When I am doing DGE with n of 3 I get really low number of significant DGEs (Differentially expressed genes). But when I do n of 2 I get much higher DEGs. Or should I report differential expression results for n of 2 only. Is n of 2 statistically significant?

Thanks, Payal

RNA-Seq sequencing • 2.5k views
ADD COMMENT
1
Entering edit mode
4.1 years ago
ATpoint 82k

Yes, that is fine. Unequal sample sizes are not unusual. You can check by PCA (plotPCA after transforming data with vst) if the suspicious sample indeed shows evidence for being an outlier. n=2 is of course far from perfect but if the 3rd samples does more harm than good due to being of low quality then yes, exclude it.

ADD COMMENT
1
Entering edit mode

Thanks a lot. I did plotPCA with vst and there were clear batch effects for n of 3. After removing the bad sample and performing DEG with n of 2, I again did plotPCA and then I saw clear PCA plots.

ADD REPLY

Login before adding your answer.

Traffic: 1560 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6