Question

SingleR for integrated scRNA-seq analysis

3

Entering edit mode

3.9 years ago

mvis1231 ▴ 120

Hi, I have data from a specific cell from mouse fed with a certain diet. I integrated 4 datasets that were measured at four different time for the integrated single-cell RNA seq analysis. I have been referring to the Seurat vignette : https://satijalab.org/seurat/v3.1/immune_alignment.html.

I am using SingleR to identify cell type for each cluster and I am wondering if I need to set DefaultAssay as "RNA" or "integrated". I tried both, but they gave me slightly different results for cell type identification.

Should I keep DefaultAssay as "RNA" or "integrated"?

Any thoughts and advice are greatly appreciated. Thank you.

scRNA-seq singler integrated Assay • 3.8k views

ADD COMMENT • link updated 2.9 years ago by FantasticAI ▴ 60 • written 3.9 years ago by mvis1231 ▴ 120

score 3 · Accepted Answer · 2020-06-08

3

Entering edit mode

3.9 years ago

igor 13k

The assay should be RNA, since SingleR expects expression values.

ADD COMMENT • link 3.9 years ago by igor 13k

1

Entering edit mode

Just to add => log2 expression values.

ADD REPLY • link 3.9 years ago by ATpoint 81k

1

Entering edit mode

Good point. I was just concerned with "RNA" vs "integrated".

ADD REPLY • link 3.9 years ago by igor 13k

0

Entering edit mode

Sorry for the silly question, what do you mean by log2 expression value? take log2 of data slot under the RNA assay? or the counts slot under the RNA assay?

ADD REPLY • link 2.9 years ago by FantasticAI ▴ 60

0

Entering edit mode

But we perform integrated assay when we trying to align cell states shared across datasets. If we will use 'RNA' slot what is the point of integration?

ADD REPLY • link 3.8 years ago by kinalimeric ▴ 40

1

Entering edit mode

Integration aims to create a common clustering landscape in which all cells are embedded. This makes it easy to compare cells which (without integration) would cluster based on batch effects, biological differences such as cell cycle, all kinds of sample-specific differences. The integration values must not be used (to my knowledge) with differential analysis methods since the process creates dependencies and notably changes magnitude and direction of changes. Therefore it probably should not be used with classifiers such as singleR which operates on Spearman correlation and therefore would suffer from changes in magnitude and direction of counts.

ADD REPLY • link 3.8 years ago by ATpoint 81k

1

Entering edit mode

To think of it another way: If you are trying to identify your cells based on independent datasets, it probably makes more sense to use the stable normalized values. The integrated values will change depending on the datasets you are integrating, which means your cell types will also change and that does not seem reasonable.

ADD REPLY • link 3.8 years ago by igor 13k

0

Entering edit mode

Under RNA assay, there are two slots, one is data, one is counts, which one should be used as SingleR input? I guess the counts slot is the raw counts, right? and the data slots are logNormalized data?

ADD REPLY • link 2.9 years ago by FantasticAI ▴ 60

1

Entering edit mode

SingleR expects the normalized counts, so you want the data slot.

ADD REPLY • link 2.9 years ago by igor 13k