Question

scRNAseq analysis with a few samples (n=2 per group)

0

Entering edit mode

20 days ago

Song ▴ 10

Hi all,

I was performing scRNA-seq analysis with only a few samples (n = 2 per group). This was due to financial limitations in our lab. However, I was wondering if I could still perform reliable differential gene expression (DEG) analysis and proportional comparison analysis with these data. It seems that at least three samples per group are required for reliable statistical results.

The total number of cells is sufficient (> 60,000).

Any suggestions would be appreciated.

Sincerely Song

DEG analysis scRNA-seq samples • 754 views

ADD COMMENT • link updated 20 days ago by ATpoint 90k • written 20 days ago by Song ▴ 10

score 3 · Accepted Answer · 2025-10-23

3

Entering edit mode

20 days ago

ATpoint 90k

Your n=2 is enables pseudobulk aggregation, and after that you can run typical bulk testings such as DESeq2 or limma. That is in my opinion and experience almost always better than treating individual cells as replicates. Basically, you would sum up the counts of all cells per sample and group/cluster/celltype/whatever, and then go down the normal bulk testing regime. Does that make sense to you?

ADD COMMENT • link 20 days ago by ATpoint 90k

0

Entering edit mode

That makes perfect sense. Thank you for clarifying, your explanation was very helpful.

ADD REPLY • link 20 days ago by Song ▴ 10

0

Entering edit mode

can we also create fake pseudobulks (to mimic biological replicates) from one single cell data by doing random cell sampling and then make the comparison with deseq ? in that case how can i understand how many fake pseudobulks that i need for enough statistical power ?

ADD REPLY • link 20 days ago by carolofharvest ▴ 50

1

Entering edit mode

In theory you can, but as you say, it's made up aka fake, so what's the point? You have two samples. Two vs two is not super powerful, but at least it's real. I mean, after all if you get no DEGs at all with that youn can (for hypothesis generation) lower the FDR cutoff, or fall back to something like geneset testing rather than strict per-gene analysis. It's all better than making up data.

ADD REPLY • link 20 days ago by ATpoint 90k