Question: Can I run EdgeR or DESeq2 on bimodal data caused by batch effect?
gravatar for jeltje.van.baren
10 weeks ago by
jeltje.van.baren80 wrote:

I have counts data for 18 samples, two conditions, sequenced in two batches. I would normally run DE analysis with batch as covariate.

A density plot on the scaled CPM data shows a bimodal distribution. Density plots of the separated batches show single peak more-or-less normal distributions so my conclusion is that the bimodality is caused by the batches.

I know that for running batch correction using ComBat you need to select non-parametric correction if the distribution isn't normal, because the regular parametric method assumes a normal distribution. I also know you're supposed to run DE on the raw counts without corrections so I can't use the batch corrected data. What I do not deeply understand is how DE programs subtract out the covariates and if they need a normally distributed dataset to do so.

So my question is: Can I run EdgeR or DESeq2 with batch as covariate the way I normally do? Or will the bimodality cause issues?

rna-seq • 195 views
ADD COMMENTlink written 10 weeks ago by jeltje.van.baren80

Hello, could you please add the code and resulting figure to your post, it is difficult to follow just based on textual description. Thanks.

ADD REPLYlink modified 10 weeks ago • written 10 weeks ago by ATpoint46k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2198 users visited in the last hour