Tool: DASC -- detecting hidden batch effects in gene expression datasets (Array/RNA-Seq/scRNA-Seq)
gravatar for Ar
2.8 years ago by
United States
Ar930 wrote:

Update: Our manuscript has been published in Bioinformatics --

Abstract: We propose a data-adaptive, non-parametric, and non-regression approach to remove the biological signal to prepare the data for batch detection and then apply a semi-NMF method to obtain the estimation of the hidden batch factors associated with the samples. To isolate the batch signal, we uses fusion penalties that shrink each individual expression profile towards the means of its corresponding biological group in a non-parametric and data-adaptive manner. To ensure the stability of the estimated batch factors, we derive a consensus matrix by applying semi-NMF multiple times. There are three major advantages of our approach compared to existing approaches:

  1. it estimates batch effects from the data
  2. it makes no assumptions on data probability distributions (no log transformation as required by svaseq) and
  3. makes no assumptions on all genes affected at the same level by batch effects

Tool : Bioconductor R Package or github source code

User Guide: How to use DASC

If you are interested in obtaining Differentially expressed genes --

  1. Calculate the batch factors using DASC
  2. Use batch factor as a covariate in your DESeq2 model

Manuscript is under preparation; will be out soon with all the comparisons to existing methods/tools (& with a lot more examples.)

ADD COMMENTlink modified 2.5 years ago • written 2.8 years ago by Ar930
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1041 users visited in the last hour