Question: DESeq2 Dispersion Estimates Curve Poor Fit?
gravatar for lransom
5 weeks ago by
lransom0 wrote:

Plot Disp Ests PlotDispEsts returns this plot after running DESeq on my RNA Seq dataset. It looks somewhat different than example plots I see online - many of the points fall below the line of the expected dispersion for genes at a given expression level. I thought the line should be fitted to my data, but it doesn't look like a very good fit to me. Has anyone seen anything like this before? I'm new to RNA Seq analysis and am wondering if I should be concerned by this plot.

Thanks for your help!

rna-seq • 189 views
ADD COMMENTlink modified 4 weeks ago by ponganta50 • written 5 weeks ago by lransom0

Can you add code and some context, so what are the samples, in terms of groups, and how many replicates, which design did you choose?

ADD REPLYlink modified 5 weeks ago • written 5 weeks ago by ATpoint46k

Sorry, just fixed the image link. I'm trying to compare RNA from exosomes in Disease vs Heathy. I have 12 biological replicates of each disease state. I created the counts using feature counts, and the design for the construction of my DESeq dataset was: ddsMatDisease=DESeqDataSetFromMatrix(countData = countdata, colData = coldata, design = ~ Disease)

Does that help?

ADD REPLYlink written 5 weeks ago by lransom0


After looking into it some more, I tried setting the DESeq parameter fitType=local, and my dispersion plot now looks like this. The curve certainly appears to fit the data more closely.

Could anyone give a description in layman's terms of the differences between the parametric fit (the default) and the local fit? Just want to make sure I'm proceeding with sound logic.


ADD REPLYlink written 5 weeks ago by lransom0

Did you quantify via Salmon? I have seen a similar fits when using DESeq2 on transcript-level counts without aggregating them to gene level.

ADD REPLYlink modified 4 weeks ago • written 4 weeks ago by ponganta50
gravatar for Frederik Ziebell
4 weeks ago by
Frederik Ziebell10 wrote:

The local dispersion fit fits a function of the form f(x) = a/x + b to the mean-of-counts vs. dispersion point cloud (equation 6 in the DESeq2 paper), while the local fit uses the locfit package to compute a moving average and is thus more flexible to fit mean-dispersion trends that do not fit the parametric form described above. From looking at quite a few mean-dispersion trends, yours looks quite strange.

It's probably best to ask this question in the Bioconductor forum and use the DESeq2 tag to ask Mike Love himself what he thinks of your dispersion trend. Have you filtered the data appropriately as suggested here?

ADD COMMENTlink modified 4 weeks ago • written 4 weeks ago by Frederik Ziebell10
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 954 users visited in the last hour