gene set enrichment analysis
4
1
Entering edit mode
21 months ago

hi all

i have one doubt regarding the gene set enrichment analysis method i have a list of upregulated genes and i have not done the differential gene expression analysis but instead i did rank prod analysis from which i obtained the list of upregulated genes so now when i m searching about the ways to analyse them i m not understanding as most of them are analysing the result from deseq2 and i m looking for the way to analyse my gene list so is there any tutorial or any material which works only on gene list

thanks

gsea pathwayanalysis • 1.9k views
ADD COMMENT
2
Entering edit mode
21 months ago
ATpoint 82k

So basically you have a list of genes and now want to know whether they're enriched for some terms or processed? I like g:profiler2 for these kinds of analysis. It has an R package if you want to automate the task.

ADD COMMENT
0
Entering edit mode

thanks for suggesting i tried to use that but the thing is that i have my gene list in form of csv the name is "upregulated.csv"and it has 720 genes so when i try to input my files and run the code it says no result to show. i tried to found out the format but they only mentioned the types of input not the format

so in what format i can input the data if you have any idea

thanks

ADD REPLY
0
Entering edit mode

Assuming the formatting of the genes was correct (please show head upregulated.csv) it can simply mean that there are no significant enrichments.

ADD REPLY
0
Entering edit mode

this is how the head upregulated.csv looks like

ADD REPLY
0
Entering edit mode

this is how the head(upregulated.csv) looks like

ADD REPLY
0
Entering edit mode

Yeah, if you indeed feed these IDs into the tool and it returns nothing then there is simply no statistical enrichment. Try relaxing the FDR threshold, maybe that helps. You can do that with the R package.

ADD REPLY
0
Entering edit mode

Okkk thanks I will try that as well and see which fits better

ADD REPLY
1
Entering edit mode
21 months ago
Marco Pannone ▴ 790

In your case, I would suggest performing an over-representation analysis (ORA) rather than GSEA. One quick way to perform ORA is on WebGestalt.

If you want to perform a proper GSEA, you need to have a full count matrix with normalized read counts per each gene as rows and all samples as columns. Then you can run GSEA, as described here (https://www.gsea-msigdb.org/gsea/index.jsp).

ADD COMMENT
0
Entering edit mode

Okk thanks for suggesting I will try this

ADD REPLY
1
Entering edit mode
21 months ago
Jeremy ▴ 910

In addition to ATpoint's suggestion of g:Profiler, you can try PANTHER, agriGO, and DAVID. Each of these tools takes a list of genes and returns gene set enrichment analysis results.

PANTHER

agriGO

DAVID

ADD COMMENT
1
Entering edit mode

Thanks i will take a look

ADD REPLY
0
Entering edit mode
21 months ago

hello all i tried to use gprofiler2 and i got the result i m sharing my way on what i did to input multiple queries so that it can be helpful to others

so as it is being said in the gprofiler tutorial that we have to create a named list of vectors so i did that its very basic code but it can be helpful to the people who are new to bioinformatics

first input the csv file df<-read.csv("upregulated genes.CSV")

select the column consisting genes or gene ids df1<-df$x

converting the selected column to vector vectordata<-as.vector(df1)

then creating the list of that vectors gene_list<-list(vectordata)

and now input the gene_list into gprofiler

hope it helps!

ADD COMMENT

Login before adding your answer.

Traffic: 1746 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6