Twice as many nFeature_RNA than nCount_RNA = more cell complexity? but how?
1
0
Entering edit mode
2.5 years ago

Hi,

I am re-analysing a publicly available single-cell RNA-seq dataset with two samples (plus minus treatment) and have downloaded preprocessed data from the geodataset as two .csv files. The authors state these files contain matrices that have been QC and logNormalized - and scaled.

After creating a Seurat object for both datasets, I checked the nFeatures_RNA and nCount_RNA for either dataset and got around twice as many nFeatures as nCounts_RNA. I can't explain this. To me UMIs are the nCount_RNA and I can't find anything on the internet proving otherwise. If nCount_RNA is UMIs, and there are only half the UMIs as genes detected, how can that many genes been detected? I believe that you can't have two RNA molecules from different genes detected by the same UMI. In other questions online, I have seen the definition of cell complexity log10(nFeature_RNA/nCount_RNA) is >0.8. Maybe it is my mathematical understanding that is failing me.

I attach a plot of the nCount_RNA against nFeatures_RNA and hope someone with a kind heart can explain how nFeature_RNA can be 2x that of nCount_RNA for a given cell. If it helps these cells should be endothelial cells from tumors.

Thank you in advance. /Maibrittenter image description here

complexity Seurat nCount_RNA nFeature_RNA scRNA-seq • 2.9k views
ADD COMMENT
1
Entering edit mode
2.5 years ago

How are you getting the nCount_RNA on data that's logNormalized and scaled? I don't think there is a way to determine that metric without the original counts, so however you're getting it is likely incorrect.

ADD COMMENT

Login before adding your answer.

Traffic: 2665 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6