What to do if normalization introduces this following artifact?
0
0
Entering edit mode
2.1 years ago

Hey,

I'm using Seurat on an scRNA-seq dataset with two groups of mice - A and B. All of what I write next is true regardless of whether I've used normalized counts from the SCT assay or the normalized counts (obtained using the "LogNormalize" method in the "NormalizeData" function) from the RNA assay. Anyway, here's the thing: pathway enrichment analyses with "clusterProfiler" - comparing group A to group B - suggest that (almost) only energy-related pathways are enriched in group A, while a diverse array of pathway types are enriched in group B. A look at the "leading edge" genes shows that the genes responsible for the enrichment of energy-related pathways in group A are highly expressed genes (in both groups), while the genes responsible for the enrichment of a variety of pathways in group B are expressed at substantially lower levels.

I'm worried about the following scenario: what if the number of transcripts per cell for non-energy-related genes is actually - in reality - similar between group A and group B, while the number of transcripts per cell of energy-related genes is indeed higher in group A than in group B? In such scenario, the normalization will make it look as if non-energy-related genes are upregulated in group B, while, in reality, they aren't.

If my worries are justified (i.e. if I'm not missing something), what can I do in order to alleviate the problem?

Cheers,

Omer

scRNA-seq Normalization • 595 views
ADD COMMENT
0
Entering edit mode
ADD REPLY

Login before adding your answer.

Traffic: 2307 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6