Question

Can someone please explain in simple terms how DESeq2 works?

13

Entering edit mode

9.3 years ago

BioStars1993 ▴ 130

As simply as possible please, and if you don't mind giving a quick definition of any specialist terms you use.

DESeq2 • 22k views

ADD COMMENT • link updated 2.1 years ago by Ram 43k • written 9.3 years ago by BioStars1993 ▴ 130

0

Entering edit mode

Check the last link in my answer.

ADD REPLY • link updated 2.1 years ago by Ram 43k • written 9.3 years ago by GouthamAtla 12k

11

Entering edit mode

9.3 years ago

GouthamAtla 12k

Look at posts by Simon Anders on Seqanswers. He explains in plain English about differential expression analysis in DESeq and also edgeR.

A few of the interesting discussions are here:

This is very nice link. I have been searching for it, just got it.

http://seqanswers.com/forums/archive/index.php/t-10797.html

ADD COMMENT • link updated 2.1 years ago by Ram 43k • written 9.3 years ago by GouthamAtla 12k

1

Entering edit mode

9.3 years ago

karl.stamm 4.1k

The publication is the most complete explanation you can get.

In short: it estimates expression values, and calculates differential expression.

ADD COMMENT • link updated 2.1 years ago by Ram 43k • written 9.3 years ago by karl.stamm 4.1k

7

Entering edit mode

Not sure that the publication qualifies for "as simply as possible" :)

ADD REPLY • link updated 2.1 years ago by Ram 43k • written 9.3 years ago by Neilfws 49k

Ram · Accepted Answer · 2015-01-22

We really tried to write the main text of the paper such that it would be understood by non-statisticians. that said, I'll try to do it in a few sentences. before further questions on details please at least try to read the paper :)

Let's say we want to compare counts between two groups. We build a model for the observed counts. This model has some parameters: (1) a normalization parameter, for differences in library size at least, or it can be extended by other software; (2) a variance parameter, called dispersion; (3) parameters representing the group differences. Fit (1) using the same method from the original DESeq. Fit (2) in two steps: first find the value of the parameter that makes the likelihood largest, which is called maximum likelihood estimation. Look at all the values from all of the genes and move these values towards a middle value. Bayes theorem guides the amount of movement for each gene: if the information for the gene is low, the value is moved more to the middle, if the information for the gene is high, the value is moved very little. Fit (3) using the same technique as used for (2). The values for (3) are a useful final product, as are sets of genes where the group differences are likely to be above a threshold (zero or otherwise). These sets are defined by their false discovery rate.