Question: Can EdgeR of DeSeq be used for Single-cell RNA-seq?
5
gravatar for neja
6.4 years ago by
neja50
Japan
neja50 wrote:

Hi all,

I've a single-cell rna-seq data and need identify DE genes across different cell groups. I would like to know can we use edgeR of DESeq for DE analysis of single-cells? If yes, then do we use the cells of the same group as replicates?

Thanks in advance!

 

rna-seq • 6.2k views
ADD COMMENTlink modified 6 days ago by ATpoint38k • written 6.4 years ago by neja50

Use monocle for single-cell RNA-seq

ADD REPLYlink modified 11 months ago by RamRS30k • written 6.4 years ago by Ming Tang2.6k

Thanks dariober for your response.

I'm aware of Monocle and am currently using it. But what I wanted to know is that can the standard RNA-seq tools be used for single-cell RNA-seq?Are there no specific requirements for single-cell rna-seq in terms of normalization etc, given that single-cell has the information of a single transcriptome whereas the standard rna-seq gives the average expression pattern of a cell-population.

ADD REPLYlink written 6.4 years ago by neja50
3
gravatar for ATpoint
4 months ago by
ATpoint38k
Germany
ATpoint38k wrote:

Edit (09/20): Also be sure to check out a new promising tool https://github.com/const-ae/glmGamPoi glmGamPoi estimates dispersion and fits the GLM to single-cell data conceptually similar to DESeq2/edgeR but with very notable gains in speed. It also implements an edgeR-like quasilikelihood ratio test, therefore could be a replacement for edgeR or DESeq2 on single-cell data that scales much better with the number of cells.


Refreshing this some years later, look at this benchmarking study which includes edgeR, DESeq2 and limma. https://www.nature.com/articles/nmeth.4612

Code for all tested pipelines is available: https://github.com/csoneson/conquer_comparison/tree/master/scripts

The key take-home messages from this paper for me were:

  • methods developed for bulk RNA-seq do not perform worse, often even better than dedicated single-cell methods.
  • Wilcox and T-Tests perform well overall but do not allow for complex designs so these are probably limited to simple two-group pairwise comparisons such as marker gene detection within clusters of the same dataset.
  • prefiltering of genes with low overall expression is beneficial for some methods such as edgeR. In the linked study the authors removed genes with a TPM below 1 in more than 25% of all cells.
  • including the cellular detection rate ("the fraction of detected genes per cell", original quote from the paper) into the design formula is beneficial for some methods such as edgeR
  • overall the edgeR QLF pipeline when filtering for lowly-expressed genes and including the cellular detection rate into the design ranks among the best tested methods in this setup

The code for the top-ranked edgeRQLFDetRate is here: https://github.com/csoneson/conquer_comparison/blob/master/scripts/apply_edgeRQLFDetRate.R

Still, I guess it strongly depends on the dataset how each method performs and we are still lacking gold standard benchmarking datasets with wetlab-confirmed true-positive and negative differential genes to robustly benchmark DE methods.

I personally prefer to aggregate clusters to pseudobulks and then run e.g. edgeR as a pseudobulk comparison, e.g. 2 vs 2 given that you have (here n=2 per group) experimental replicates for your scRNA-seq data. This is (in my hands) much more robust, based on meeting anticipated results and expected pathway enrichment. But this is only based on my dataset so not sure one can generalize.

This table ranks all tested tools and pipelines towards performance in this study with their datasets and their choice of parameters. I strongly suggest to explore things yourself on your data.

enter image description here

ADD COMMENTlink modified 6 days ago • written 4 months ago by ATpoint38k
1
gravatar for dariober
6.4 years ago by
dariober11k
WCIP | Glasgow | UK
dariober11k wrote:

I would say yes, you can look for differential expression from single cell RNA-Seq. In the end edger/DEseq don't care where your counts come from.

About your second question, it's up to you to decide which sets of libraries to contrast to look for DE. Comparing cells from one group vs cells from another group seems sensible if your are intersted in the differences between groups.

ADD COMMENTlink written 6.4 years ago by dariober11k

I found Deseq2 used in package Seurat

ADD REPLYlink written 23 months ago by linouhao0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1418 users visited in the last hour