PlotDispEsts returns this plot after running DESeq on my RNA Seq dataset. It looks somewhat different than example plots I see online - many of the points fall below the line of the expected dispersion for genes at a given expression level. I thought the line should be fitted to my data, but it doesn't look like a very good fit to me. Has anyone seen anything like this before? I'm new to RNA Seq analysis and am wondering if I should be concerned by this plot.
Thanks for your help!
After looking into it some more, I tried setting the DESeq parameter fitType=local, and my dispersion plot now looks like this. The curve certainly appears to fit the data more closely.
Could anyone give a description in layman's terms of the differences between the parametric fit (the default) and the local fit? Just want to make sure I'm proceeding with sound logic.
Thanks!
Can you add code and some context, so what are the samples, in terms of groups, and how many replicates, which design did you choose?
Sorry, just fixed the image link. I'm trying to compare RNA from exosomes in Disease vs Heathy. I have 12 biological replicates of each disease state. I created the counts using feature counts, and the design for the construction of my DESeq dataset was: ddsMatDisease=DESeqDataSetFromMatrix(countData = countdata, colData = coldata, design = ~ Disease)
Does that help?
Did you quantify via Salmon? I have seen a similar fits when using DESeq2 on transcript-level counts without aggregating them to gene level.