Question

DESeq2 with 2 factors: problem to fit the negative binomial and weird p-value distributions

0

Entering edit mode

9.0 years ago

Aurelie MLB ▴ 360

Hello,

I have RNA-seq data from 27 samples to analyse. I have 2 differents treatments (Compound1 and compound 2) + control (DMSO) and I have 3 times. I have 3 replicates for each combination.

I am trying to use DESeq2 to find the differentially expressed genes. First, may I double check with you that the formula looks OK: ~treatment*time? (I have found ~cell + dex on http://www.bioconductor.org/help/workflows/rnaseqGene/ and I am confused. I am wondering why you would use this formula. Is there something special I should now about DESeq2 formulas?)

So, using a pooled analysis with all the samples at once and the design ~treatment*time:

I tried to plot the dispersion versus the mean normalized counts (using plotDispEsts(dds)). I am not sure what is the red line in this plot because I thought the parameters were calculated per gene using the replicates... But for the low mean normalized counts (where the dispersion is bigger), this red line is well above the black and shrinked blue points. Is this the sign of a problem?
I also looked at the p-value plots for different comparisons and some of them are not flat at all. The p-values seem to slowly increase for the low p-values to the high p-values with even some spike at 0.4 or 0.8. Is this classical for DESeq2 or is this the sign of a problem?

I thought that there weird observations were coming from a high variability between times so I tried to do a similar analysis but per time (formula: ~treatment). My problem is that doing this I obtain very different lists of differentially expressed genes...

I am not sure which one is the best analysis really... any suggestion please?

Many thanks

DESeq2 RNA-Seq • 3.2k views

ADD COMMENT • link updated 21 months ago by Ram 43k • written 9.0 years ago by Aurelie MLB ▴ 360

Ram · Answer 1 · 2015-04-29

1

Entering edit mode

9.0 years ago

Michael Love ★ 2.6k

We discourage cross posting the same question on multiple sites because it duplicates everyone's effort in answering your questions. At the least, please link to the Bioc support site posts.

ADD COMMENT • link 9.0 years ago by Michael Love ★ 2.6k

0

Entering edit mode

Apologies! I did not know. I posted here first and then saw that the Bioconductor support website was recommended in your documentation so I thought it would be more appropriate to post there finally. All I can say now is that it will not happen again...

The bioconductor post is there with your answer is there: https://support.bioconductor.org/p/67132/

ADD REPLY • link updated 21 months ago by Ram 43k • written 9.0 years ago by Aurelie MLB ▴ 360