Question: Batch Adjusted counts for Downstream Analysis
gravatar for saicharanp18
5 weeks ago by
saicharanp180 wrote:

Hi All,

I need some help with removing batch effects. I have 21 samples 11 samples sequenced in batch A and 10 sample in Batch B. Both batches have both genotypes to compare.

  1. So, In order to perform further downstream differential expression analysis, i need batch adjusted count matrix. How can i get this. i tried combat, limma removeBatcheffects() but it gives negative values for zero counts.

  2. Can i use batch adjusted matrix as a count matrix and perform DEA.

Thanks in advance.

rna-seq • 128 views
ADD COMMENTlink modified 4 weeks ago by ATpoint29k • written 5 weeks ago by saicharanp180
gravatar for segato.felipe
5 weeks ago by
segato.felipe20 wrote:


You could use combat to correct for batch effects, remove genes absent in most samples and perform your DE analysis using a statistical test. We did this in our paper: (

ADD COMMENTlink written 5 weeks ago by segato.felipe20

Just out of interest, why did you use such a custom DEG methodology including t-tests instead of any of the established tools such as limma/edgeR/DESeq2?

ADD REPLYlink modified 4 weeks ago • written 4 weeks ago by ATpoint29k

Hi, Thanks for the reply. did you use combatseq or the original combat. if you used original combat how did you deal with the negative values in batch adjusted counts, because i had some problems with combat for rnaseq. so i switched newer version combat-seq which preserves count characteristics..

Best, sai

ADD REPLYlink written 4 weeks ago by saicharanp180
gravatar for ATpoint
4 weeks ago by
ATpoint29k wrote:

Typically one includes batch into the design such as ~ batch + condition. Check e.g. the DESeq2 and edgeR manuals for this. This would correct for the baseline differences induced by batch. Also please browse the web for this question, there are literally dozens of similar questions already at Bioconductor support forum and the developers of the standard DEG tools have extensively commented there.

ADD COMMENTlink written 4 weeks ago by ATpoint29k

Thanks @ATpoint. I tried different DEG tools and adjusting batches in the design formula. Our RNASeq protocol is a little bit different than bulk RNA Seq so i had to do some outlier analysis for which i needed batch adjusted normalized counts. I ended up using newer version of combatseq, which preserves integer characteristic of count data.

Thanks, Sai

ADD REPLYlink written 4 weeks ago by saicharanp180
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 796 users visited in the last hour