Question

(Single-cell RNA seq; R - Seurat) Re-clustering after removing cells using SubsetData

3

Entering edit mode

5.1 years ago

vmallett ▴ 30

I have a question regarding re-clustering after removing cells using SubsetData (R - Seurat package).

I am in the process of analyzing a relatively large single-cell dataset (16 separate samples of ~5-10k cells each). In our first run of the Seurat pipeline, we ran multiCCA to align/generate CCs from the 16 datasets. Subsequent analysis was performed using the ‘large Seruat’ output file generated from multiCCA.

Next, we wanted to eliminate unwanted cell types/clusters from this dataset. To do so, I used SubsetData to remove several clusters, creating a new ‘large Seurat’ file. Given that the dataset was generated using CCA, I can’t simply re-run FindVariableGenes and ScaleData, and then re-cluster (which would require running PCA)…

In our case, would the next steps be: 1) Identifying the individual cellIDs that were removed with SubsetData 2) Locating those cells in the original (pre-CCA) ‘large Seurat’ files 3) Deleting the unwanted cells there 4) Repeating FindVariableGenes and ScaleData 5) Re-running CCA across the manually filtered 16 samples

Is there a better/simpler way to go about doing this?

Thank you!

RNA-Seq R Seurat Single-cell • 5.0k views

ADD COMMENT • link updated 21 months ago by Kusunoki • 0 • written 5.1 years ago by vmallett ▴ 30

0

Entering edit mode

Hi vmallett, I am currently encountering the same situation. How did you finally finish your analysis?

ADD REPLY • link 21 months ago by Kusunoki • 0