Question

Intuitive explanation of GSVA analysis

1

Entering edit mode

6.1 years ago

arronar ▴ 280

I'm trying to understand the way the GSVA analysis is working behind the scenes.And I was wondering if there is any way to understand it more intuitively the whole process.

So at first according to paper it starts by evaluating whether a gene i is highly or lowly expressed in sample j in the context of the sample population distribution. They use these kernel estimations of the cumulative density functions to transform the initial values so not to be affected by the problematic intensities.

After this "transformation" and a following normalization, GSVA calculates the enrichment scores using the Kolmogorov-Smirnov (KS) like random walk statistic.

As I know, Kolmogorov-Smirnov checks for differences in distributions. Which distributions does it check? Gene-set's against all the others genes? And what is the role of the random walk?

So is there any intuitive way to understand this kind of Kolmogorov-Smirnov (KS) like random walk statistic? How does it actually work? Which one is the null and which the alternative hypothesis in that case?

microarray GSVA understanding • 2.8k views

ADD COMMENT • link updated 5.1 years ago by Pietro ▴ 230 • written 6.1 years ago by arronar ▴ 280

score 0 · Answer 1 · 2019-03-14

0

Entering edit mode

5.1 years ago

Pietro ▴ 230

You may want to give a look at this https://towardsdatascience.com/decoding-gene-set-variation-analysis-8193a0cfda3

ADD COMMENT • link 5.1 years ago by Pietro ▴ 230