Question

DEXSeq batch effects issue

0

Entering edit mode

6.9 years ago

ag1194 • 0

Hi I have a question about DEXSeq batch effect design. I followed the protocol from DEXSeq guideline to count in the batch effect.

featureID <- as.factor(data $featureID)  
groupID <- as.factor(data $gene_symbol)
design <- formula( ~ sample + exon + condition:exon )
formulaFullModel = ~ sample + exon + type:exon + condition:exon
formulaReducedModel = ~ sample + exon + type:exon
sampleTable = data.frame(row.names = c(paste0("num_",sample_3[1]),paste0("num_",sample_3[1+1]), paste0("num_",sample_6[1]),paste0("num_",sample_6[1+1])), condition = c("Control","Control", "Treatment","Treatment"), type = c("S","M"))

dxseq = DEXSeqDataSet(data1, sampleTable, design, featureID, groupID)
dxseq = estimateSizeFactors(dxseq)
dxseq = estimateDispersions(dxseq, formula= formulaFullModel)
dxseq = testForDEU(dxseq, reducedModel = formulaReducedModel, fullModel = formulaFullModel )
dxseq = DEXSeqResults(dxseq)

I also did DEXSeq without counting the batch effect. Without batch effect I've got more number of significantly regulated isoform than with batch effect design. According to their guideline, I suppose to observe more significant cases with the code above. I tried this method with different dataset and I always has less significant cases with the batch effect formula.

Looking forward to hear any explanation to understand the reason.

Thanks! -A

RNA-Seq DEXSeq • 1.6k views

ADD COMMENT • link 6.9 years ago by ag1194 • 0