Question

Difference between number of cells predicted by two different versions of cellranger

0

Entering edit mode

2.1 years ago

Nitin Narwade ★ 1.6k

Hii,

I am trying CellRanger for my single-cell transcriptomic data (3’ Gene Expression v2 Libraries). I run the analysis on this data using two different versions of CellRanger v2.2.0 and v6.1.0, the number of cells in both analyses are drastically different. with v2.2.0 we are getting an average of ~8000 cells whereas with the new version we are getting ~11000.

What could be the possible reason behind this and which version should I use?

Any suggestion will be greatly appreciated.

Regards, Nitin N.

Cellranger scRNA-Seq • 1.4k views

ADD COMMENT • link updated 2.1 years ago by jared.andrews07 ★ 16k • written 2.1 years ago by Nitin Narwade ★ 1.6k

1

Entering edit mode

You can go through the release notes at https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/release-notes (see red box at lower-right corner to switch versions) to see what changed in the individual versions.

ADD REPLY • link 2.1 years ago by ATpoint 82k

score 4 · Accepted Answer · 2022-03-29

4

Entering edit mode

2.1 years ago

jared.andrews07 ★ 16k

This is (likely) primarily due to the swap to using a version of the EmptyDrops algorithm for cell calling in version 3.0.

Cell Calling Changes Cell Ranger 3.0 implements a version of the EmptyDrops cell calling algorithm that will call more low RNA content cells, especially when they are mixed with a population of high RNA content cells. See Cell Calling Algorithms for details. The cell calling 'knee-plot' in the web summary now indicates what fraction of barcodes in each segment of the curve were called as cells, since the new cell calling algorithm no longer makes a hard threshold on UMI counts.

ADD COMMENT • link 2.1 years ago by jared.andrews07 ★ 16k

0

Entering edit mode

Thanks, Jared,

I don't know how I missed this while reading. Probably I checked the major changes only for CellRanger v2.2.0 and v6.1.0.

Thank you for the information. So do you have any suggestions for removing the low RNA content cells from the downstream?

BTW, I am using Seurat for the downstream analysis. Currently, I am using this code for the initial QC (following this tutorial).

filtered_seurat <- subset(x = merged_seurat, 
                     subset= (nUMI >= 500) &  # number of UMI per cell
                       (nGene >= 500) &   # minimum number of genes per cell
                       (nGene <= 4000) & # maximum number of genes per cell
                       (log10GenesPerUMI > 0.85) & # complexity ratio: 
                       (mitoRatio < 0.05) & 
                       (doublet_status == FALSE))

Do you have any suggestions on this to remove cells with low RNA content?

Thanks and Regards,

Nitin N.

ADD REPLY • link 2.1 years ago by Nitin Narwade ★ 1.6k

1

Entering edit mode

Well, what you've got there will already mostly do that, though arbitrary thresholds run their own risks.

ADD REPLY • link 2.1 years ago by jared.andrews07 ★ 16k