Question: What is the best way to rank genes for GSEA?
gravatar for Gabriel
18 months ago by
Gabriel60 wrote:

I am doing pathway and gene ontology analysis using Gene Set Enrichment Analysis(GSEA). For the tools, you need to provide a ranked gene list, however, various papers have provide different recommendations on how to do this.

Is there a current consensus on what is the ideal way to do this? I've been using Log2 Fold change, and I am unsure weather to use Fold Change, p-values instead. Or an other method?

One post: Problem with creating GSEA rank file recommended signed p-values, but I haven't found any literature reviews or clarification on the issue. clusterProfiler mentions fold change for ranked gene lists, so I am unsure if I would be getting "bad results" by using p-value sorting. And if the different packages are optimized for one or the other sorting.

According to Yu, author of cluster profiler:

geneList contains three features: numeric vector: fold change or other type of numerical variable named vector: every number has a name, the corresponding gene ID sorted vector: number should be sorted in decreasing order

"other type of numerical variable" is unclear. Perhaps there are other, similar methods to GSEA who have a more concrete way of doing things?

EDIT: for clusterProfiler's function gseGO() I get different result when using Log2FoldChange versus FoldChange for ranking

rna-seq go gsea gene ranking • 3.9k views
ADD COMMENTlink modified 18 months ago by Pietro100 • written 18 months ago by Gabriel60
gravatar for Pietro
18 months ago by
Pietro100 wrote:

Hi Gabriel

For GSEA, some they do signed fold change * -log10pvalue, found it here:

ADD COMMENTlink modified 18 months ago • written 18 months ago by Pietro100
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1215 users visited in the last hour