Question: Problem with creating GSEA rank file
3
gravatar for noushin.farnoud
3.7 years ago by
United States
noushin.farnoud100 wrote:

Dear Biostars,

I want to prepare the rank file for GSEA analysis based on RNA-seq results that were generated by DESeq2.  I have  found different recommendations as how to create the pre-ranked gene list. The GSEA site mentions the gene list can be sorted by any value, however, other people have pointed out in this blog that the direction of fold change is important for GSEA analysis. Now, if the genes are sorted only based on their log fold change, a gene with a large fold change but a poor p-value will be ranked higher than a gene with a statistically significant fold change that is smaller in magnitude. 

I've also read Mark Zeimann's post about his approach to this issue where he generates a new scoring metric by multiplying the sign of fold change by its inverse p-value: http://genomespot.blogspot.com.au/2014/09/data-analysis-step-8-pathway-analysis.html
 He also adds that: "at the top of the list are the genes with "strongest" up-regulation and the bottom of the list are genes with "strongest" down-regulation and genes not changing are in the middle". I am not sure if this is the right assumption for GSEA input file?

I greatly appreciate if you could help me understand this, and explain me your preferred method for creating a GSEA rank file for RNASeq expression results. 

Many Thanks,
Noushin

 

rna-seq gsea • 5.1k views
ADD COMMENTlink modified 3.7 years ago • written 3.7 years ago by noushin.farnoud100
3

Hi Noushin, As you note there is no set ranking method for GSEA. There are many alternatives. Some people use fold change. In our group we use signed p-value. It is not a new method, nor did I invent or "generate" it. We have published this extensively and no reviewer has ever commented on the validity. I've included a list of papers in the last 2 years using this method. Now what was your specific question? Cheers, Mark

http://www.ncbi.nlm.nih.gov/pubmed/27157830 http://www.ncbi.nlm.nih.gov/pubmed/27108813 http://www.ncbi.nlm.nih.gov/pubmed/25941940 http://www.ncbi.nlm.nih.gov/pubmed/25491312 http://www.ncbi.nlm.nih.gov/pubmed/24875254 http://www.ncbi.nlm.nih.gov/pubmed/24850797 http://www.ncbi.nlm.nih.gov/pubmed/24732587

ADD REPLYlink written 3.0 years ago by mark.ziemann1.2k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1335 users visited in the last hour