Question

GSEA Analysis: rank genes by fold change, t-statistic, U-statistic or KS-statistic?

0

Entering edit mode

7.8 years ago

Juan Cordero ▴ 140

Dear community,

in order to find significant enrichment in certain gene sets for my microarray data (2 conditions), I have tried out with different methods to calculate the gene scores, yielding different results in the number of enriched gene sets. In this way, the methods with the decreasing number of enriched gene sets (q-value < 0.05) are, for KEGG pathways: KS (81) > U-statistic (32) > FC (14) > t-statistic (0). The U-statistic 32 gene sets include the FC 14 gene sets, and the KS-statistics 81 gene sets include in turn the U-statistics ones. The intersection of KS, U and FC results equals FC. It might be possible KS-statistics yields many false positives.

I would be very very grateful if somebody could give me a piece of advice on this, since I do not know which method should be the most appropiate.

I am using gage package in R by the way.

Thanks

microarrays gsea ranking • 3.7k views

ADD COMMENT • link updated 7.8 years ago by Biostar 20 • written 7.8 years ago by Juan Cordero ▴ 140

2

Entering edit mode

Different scores do different things. Try to understand how they relate to the question you're interested in e.g. if you're most interested in effect size then the fold change is what you should use but if you're more interested in statistical significance then look for one of the statistics taking into consideration the assumptions they make e.g. t-test assumes normally distributed data whereas KS doesn't.

ADD REPLY • link 7.8 years ago by Jean-Karim Heriche 27k