Question: Should I pre-normalize RNAseq data before combat?
gravatar for zhaoliang0302
12 months ago by
zhaoliang03020 wrote:

Hello, I want to use combat to remove batch effct. The format of rnaseq data is counts from RSEM. The documention of sva package said data should be normalized before using combat. So, how to normalize the data? I want to get counts after removing batch effect for subsequent differential analysis (DEseq/edgR). But DEseq and edgR needs raw counts as input, not TPM/FPKM. What should I do? Best, Zhao

ADD COMMENTlink modified 12 months ago by Kevin Blighe61k • written 12 months ago by zhaoliang03020
gravatar for Kevin Blighe
12 months ago by
Kevin Blighe61k
University College London
Kevin Blighe61k wrote:

What evidence do you have that a batch effect exists? I would avoid the use of ComBat. You can simply include batch as a covariate in your design formula and perform differential expression analysis in that way.

So, what I am saying is this: you do not have to use ComBat. If you believe that a batch effect exists, include the batch covariate in the design formula.

ADD COMMENTlink written 12 months ago by Kevin Blighe61k

Hello Kevin, Much thanks. I know what you said now. I plot PCA and found a signifiant difference between two batches. After your explanation, I won't use combat . Now the question is I don't know the expression data is count or FPKM, the official documentation is showed in the pictures pic1pic2 The maxima of the expression data is 14266614 and there are decimals. I don't know what kind of this data is. Can you help me?

ADD REPLYlink written 12 months ago by zhaoliang03020
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1310 users visited in the last hour