How single sample GSEA works
0
1
Entering edit mode
6.3 years ago
sugus ▴ 150

Hi guys,

I got confused to interpret the result derived by single sample gene set enrichment analysis.

To be specific, I have the expression data for a specific pathway and show it in a heatmap by log and z-score transformation of the FPKM matrix for each gene. See the heatmap as the following:

heatmap

I further compressed the signature into a single score per sample by ssgsea algorithm using GSVA R package. by the following command:

enrichscore.meta <- gsva(as.matrix(FPKM.HUGO),meta.sig,method="gsva")

The enrichment score was then z-score and truncated to [-1,1], and shown in a scoremap. See the scoremap as the following: scoremap

You could see in the heatmap, under the first red subtype, there are two blocks with either upregulated (red rectangle) or downregulated (blue rectangle) expression, and the number of these dysregulated genes is almost 50-50.

However, how could I get a corresponding scoremap with totally blue score in the first subtype (black rectangle) which means this subtype is generally downregulated in this pathway (signature), as compared to other subtypes presented with relative high scores. It doesn't make any sense.

Is there anything wrong, and I hope to get some advises.

Many thanks advanced!

ssgsea RNA-seq R GSVA • 11k views
ADD COMMENT
0
Entering edit mode

What are you enriching your data to?; what is contained within the object meta.sig? This is important to understand.

However, how could I get a corresponding scoremap with totally blue score in the first subtype (black rectangle) which means this subtype is generally downregulated in this pathway (signature), as compared to other subtypes presented with relative high scores. It doesn't make any sense.

You can 'fix' the order of your heatmap. How you do this will depend on which heatmap package / function that you are using. Which heatmap package / function are you using?

ADD REPLY
0
Entering edit mode

Thanks, the meta.sig is a list contains 23 metabolism related pathways. Each element in the list is a vector of genes corresponding to the specific pathway. I am not asking the order because the order is fixed either in the heatmap or scoremap. I just wonder why the enrichment score would be totally blue but the corresponding gene expression is not generally downregulated, because you could almost 50% is upregulated and 50% is downregulated.

ADD REPLY
0
Entering edit mode

Why do you use method = 'gsva'and not method = "ssgsea"as you write? Also in the first heatmap you just show the gene expression? And then the signature you make are just of all genes?

ADD REPLY
0
Entering edit mode

Sorry the code is another test, the 'actual' score was calculated by ssgsea method. The heatmap showed the expression of all genes included in the first pathway, which show in the scoremap. I used specific gene list corresponding to a pathway to make a signature. In the meta.sig, there are 23 pathways inside it, so I would get 23 signatures but I just show one of them that confusing me a lot.

ADD REPLY
0
Entering edit mode

Ahh okay - did you turn off the ssgsea post-normalization (via the ssgsea.norm argument - on by default)? Else that could explain it :-)

ADD REPLY
0
Entering edit mode

No, this parameter was just like it was. OMG it is really confusing... Maybe I should just calculate the average value for each signature and at least that would be more interpretable.

ADD REPLY
0
Entering edit mode

I actually think it is a good idea as mostly when you test many signatures you are interested in comparing across those signatures :-)

ADD REPLY

Login before adding your answer.

Traffic: 3244 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6