I have RNA-seq for 62 samples from 4 groups sequenced in 2 known batches, with a clear batch-effect in the PCA. I've derived expression using FPKM, and I've used Combat on log2(FPKM + 1) values. This removes the batch effect successfully, but now I would like to analyze differential gene expression between the 4 groups.
My question/concern: I've read a few people saying that you shouldn't use these Combat-adjusted values for differential expression analysis, but I don't understand why. Can someone please comment on this or explain?