Question: Input genes for GSVA
gravatar for PR
2.5 years ago by
PR20 wrote:

Hello All,

I would like to know what exactly should be the set of genes that go as input in the expression matrix to GSVA. I am working particularly within the context of single-cell data, and have marker genes for two groups. I would like to find out the differential pathways between these two groups. First, I was thinking, since I have already identified a set of marker genes (about a 100 genes for each group), it would make sense to give the union of the two marker gene sets as the genes to include in the GSVA input. But now after reading more about how GSVA/GSEA work, I feel that the entire raw set of genes (about 14000 of them) should go as input so that the enrichment results would be stronger.

Is this true?

It would be great if somebody could explain what exactly should be the set of genes going into GSVA, is it better to give a restricted list or the entire list? Does "the more the merrier" apply here?

Thanks in advance for any responses!

P.S.: A while ago, I posted this related question, GSVA for single-cell marker genes but did not get any responses. So, I thought I will try again.

gsva gene expression • 1.8k views
ADD COMMENTlink modified 2.5 years ago by Kevin Blighe66k • written 2.5 years ago by PR20
gravatar for Kevin Blighe
2.5 years ago by
Kevin Blighe66k
Kevin Blighe66k wrote:

You may not have had any responses because enrichment is not a hot topic - in fact, the results of gene enrichments are frequently taken with a 'pinch of salt'. Certainly, you would never publish in any of the major journals with conclusions based solely on gene enrichment, even with GSVA.

In enrichment, one has to strike a balance between too much and too few genes. You would not include all genes in the enrichment, of course, but you would neither set your statistical thresholds for statistical significance too low. You will just have to go back and forth until you get results that make sense based on your previous hypotheses and the current knowledge on the disease that you are researching.


Edit 27 February 2019: some enrichment programs, like GSEA / GSVA, do take all of your genes and then determine which pathways / signatures are up or down from this.

ADD COMMENTlink modified 20 months ago • written 2.5 years ago by Kevin Blighe66k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1235 users visited in the last hour