Removal of mRNA contamination from scRNA-seq data using SoupX: How to proceed when "Extremely high contamination estimated"?
Entering edit mode
3.1 years ago
AA • 0

I am using SoupX to quantify, profile, and remove ambient mRNA contamination from my scRNA-seq data.

I've tried running the automated workflow on my dataset and am getting the below message.

sc = load10X("/path/to/output") 
sc = autoEstCont(sc)
# 127 genes passed tf-idf cut-off and 35 soup quantile filter.  Taking the top 35.
# Using 252 independent estimates of rho.
# Estimated global rho of 0.75
# Error in setContaminationFraction(sc, contEst, forceAccept = forceAccept) : 
# Extremely high contamination estimated (0.75).  This likely represents a failure in estimating the contamination fraction.  Set forceAccept=TRUE to proceed with this value.

I'm very new to bioinformatics and scRNA-seq analysis and am wondering how to proceed. What should I do to check if this is "real" before moving on and correcting expression profile.

I've been trying to do some of the visual sanity checks such as mentioned in the vignette but it seems I first need to do the "manual method" to estimate the contamination fraction. However after reading through the vignette several times I'm still confused on the exact code I need to run. I keep running into error "'x' must be an array of at least two dimensions".

scRNA-seq RNA-Seq SoupX • 1.1k views

Login before adding your answer.

Traffic: 2726 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6