Question: Run GSEA with custom gene sets?
0
gravatar for O.rka
27 days ago by
O.rka70
O.rka70 wrote:

Is there a way to use custom gene sets using GSEA? I downloaded GSEApy and would love to use it within Python but if this is not possible then I can definitely use the commandline version.

For example, what if I had the following data:

# Gene list
gene_list = [
"gene_A", 
"gene_B", 
...
"gene_xyz"
]

and then a bunch of gene sets like this:

gene_sets = {
"gene_set_1": ["gene_A", "gene_B", ...],
"gene_set_2":["gene_B", gene_C", ...], 
...
"gene_set_100":["gene_A", "gene_T", ...]
}

I also have expression data for all of this with multiple samples if that could be used as well.

Is there still a way to run GSEA(py?) to figure out which gene sets are enriched?

I usually deal with microbiome datasets with de-novo called ORFs from prodigal so there are no gene ids that would be useable. A lot of my friends in cancer labs always talk about GSEA but I don't have the types of IDs that they use. Though, I still have "gene sets" that I could use so I feel like it could still apply.

ADD COMMENTlink modified 27 days ago by h.mon21k • written 27 days ago by O.rka70
0
gravatar for h.mon
27 days ago by
h.mon21k
Brazil
h.mon21k wrote:

For the command-line, you have to prepare the gene-set in gmt format, and pass it with -g gene_sets.gmt. For the API, you have to use the gene_sets parameter from the gseapy.gsea() function.

ADD COMMENTlink written 27 days ago by h.mon21k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1361 users visited in the last hour