Question

Manual annotation of cell types in single cell RNA-seq

1

Entering edit mode

2.7 years ago

Gene_MMP8 ▴ 240

I have recently started working with scRNA-seq data. I am following the tutorials by the creators of Seurat. In the final section titled "Assigning cell type identity to clusters", the authors mention that

Fortunately in the case of this dataset, we can use canonical markers to easily match the unbiased clustering to known cell types:

After this, they define a set of genes and the corresponding cell types and use that to annotate their dataset. I wanted to apply the same concept to a different scRNA-seq based breast cancer dataset. I derived the final list of clusters but got stuck while trying to manually annotate them. Basically, I have downloaded a list of markers (a matrix containing genes and cell types) from the Cell Marker database. I also have an expression matrix with rows as genes and columns as cells. Now how do I manually assign each cell to a particular cell type?

Seurat single_cell annotation • 4.3k views

ADD COMMENT • link updated 2.7 years ago by Friederike 8.9k • written 2.7 years ago by Gene_MMP8 ▴ 240

1

Entering edit mode

Manual annotation is a difficult task in that it requires you to have a clear idea of what cell types are present in your samples and what marker genes define them. If you're struggling with it, perhaps you could try an automatic annotation instead (eg SingleR).

ADD REPLY • link 2.7 years ago by Martombo ★ 3.1k

0

Entering edit mode

The thing with SingleR is that I am not getting the cell types I am expecting given my data. For instance, I have breast cancer expression measurements, so I am expecting basal, luminal cell types. But SingleR is skipping those entirely for some reason.

ADD REPLY • link 2.7 years ago by Gene_MMP8 ▴ 240

0

Entering edit mode

I think you can use an ad hoc dataset to infer cell types in singleR. So in your case, you could use a public scRNA-seq of breast cancer patients in which they identified these subtypes. Or otherwise you could use some marker genes, if these are well described.

ADD REPLY • link 2.7 years ago by Martombo ★ 3.1k

0

Entering edit mode

There can be several reasons for that:

Your reference data set does not contain basal and luminal cell types.
Your target data set does not contain basal and luminal cell types. 2.1 Your target data set's basal and luminal cell types might be so "screwed up" that they are too dissimilar from the basal and luminal cell types in the reference data set.

SingleR will only work well if your reference data set encompasses the cell types that you have in your target data set. You could try to find an annotated single-cell reference data set from the literature that deals with very similar cell types.

ADD REPLY • link 2.7 years ago by Friederike 8.9k

score 1 · Answer 1 · 2021-08-10

To manually annotate the cluster you need to see which cluster express the markers.

endothelial_markers <- c("Pecam1") # mouse dataset

# plot expression
DefaultAssay(srtObject) <- "RNA"
Idents(srtObject) <- "the column containign the clustering you want to use for the annotation"
FeaturePlot(srtObject, features = tcd8_markers)
DotPlot(srtObject, features = tcd8_markers) # useful when you are looking at subpopulations

In the example above, we can clearly localise the endothelial cells. enter image description here