I am using Seurat to analyze scRNA-seq. I used a function AggregateExpression() for pseudobulk analysis. But I suddenly remembered this function give me a biased result, because expression of the result may be proportionate to the number of cells(cell counts). AggregateExpression() returns the sum of gene expression.
I am analyzing T cells. I could get highly expressed genes in CD8 T effector mermory and CD8 T exhausetd (these subtypes have high proportion in my T cell data). I suppose this result come from cell counts. Is this right? If so, how can I treat pseudobulk?
- I am worried that celltype1 and celltype2 have similar expression in single cell level, but have high difference of expression in bulk level because of cell counts not as other reason.
Thanks a lot for reading my question.
Hi, Did you figure this out? In my data some gene expressed in very low number of cells (less than 1% of total cells) appear as differentially expressed after pseudobulk. Did you do any filtering for the total number of cells before/after aggregation or pseudobulk differential expression analysis?