Question

Twice as many nFeature_RNA than nCount_RNA = more cell complexity? but how?

0

Entering edit mode

2.5 years ago

MaibrittMardahl • 0

Hi,

I am re-analysing a publicly available single-cell RNA-seq dataset with two samples (plus minus treatment) and have downloaded preprocessed data from the geodataset as two .csv files. The authors state these files contain matrices that have been QC and logNormalized - and scaled.

After creating a Seurat object for both datasets, I checked the nFeatures_RNA and nCount_RNA for either dataset and got around twice as many nFeatures as nCounts_RNA. I can't explain this. To me UMIs are the nCount_RNA and I can't find anything on the internet proving otherwise. If nCount_RNA is UMIs, and there are only half the UMIs as genes detected, how can that many genes been detected? I believe that you can't have two RNA molecules from different genes detected by the same UMI. In other questions online, I have seen the definition of cell complexity log10(nFeature_RNA/nCount_RNA) is >0.8. Maybe it is my mathematical understanding that is failing me.

I attach a plot of the nCount_RNA against nFeatures_RNA and hope someone with a kind heart can explain how nFeature_RNA can be 2x that of nCount_RNA for a given cell. If it helps these cells should be endothelial cells from tumors.

Thank you in advance. /Maibritt enter image description here

complexity Seurat nCount_RNA nFeature_RNA scRNA-seq • 2.9k views

ADD COMMENT • link updated 2.5 years ago by jared.andrews07 ★ 16k • written 2.5 years ago by MaibrittMardahl • 0

score 1 · Answer 1 · 2021-10-18

1

Entering edit mode

2.5 years ago

jared.andrews07 ★ 16k

How are you getting the nCount_RNA on data that's logNormalized and scaled? I don't think there is a way to determine that metric without the original counts, so however you're getting it is likely incorrect.

ADD COMMENT • link 2.5 years ago by jared.andrews07 ★ 16k