Question: How to perform KEGG enrichment analyses on a set of genes
2
gravatar for miyakokodama
2.9 years ago by
miyakokodama20
miyakokodama20 wrote:

Possibly a silly question...

I work on a non-model plant species that has a sequenced genome. All genes from the map were aligned against KEGG proteins, therefore about ~70% genes on the map are annotated with K-term.

I performed differential expression analyses and obtained a set of interesting genes (230 genes), and I have an input that looks something like this:

Gene_1 hsa:9380 K00049

Gene_3 mdo:100015233 K00081

Gene_4 ath:AT1G30270 K00924

Gene_5 ath:AT1G30270 K00924

Gene_6 ath:AT3G59420 K00924

Gene_7 ath:AT3G59420 K00924

where I have a gene name for my species, locus tag ID (and the species in which the best hit was identified with), and its corresponding K-term.

Now I wanted to do pathway enrichment analyses and get a p-value for enriched pathways, but having to pick a species to do so has been hindering the process. Does anyone know how to deal with a situation like this? Any help would be greatly appreciated.

R rna-seq kegg enrichment pathway • 7.7k views
ADD COMMENTlink modified 14 months ago by bigmawen310 • written 2.9 years ago by miyakokodama20

I'm facing the same problem here, have you succeed afterwards? Can you share with me some experience?

ADD REPLYlink written 2.5 years ago by dukecomeback40

i am also facing the same proble, i got 10 different species and each species with several genes, could you tell me how to do the KEGG pathway enrichment analysis?

ADD REPLYlink modified 2.0 years ago • written 2.0 years ago by qiangzhulrs0
1
gravatar for dago
2.9 years ago by
dago2.5k
Germany
dago2.5k wrote:

There are many tools in Bioconductor, as clusterProfiler. Also, you can look into this post for more details

ADD COMMENTlink modified 2.9 years ago by Egon Willighagen5.2k • written 2.9 years ago by dago2.5k

Is there any alternate to kegg that is more recently updated?

ADD REPLYlink written 2.9 years ago by Parham1.4k

mmm...metaCyc maybe, but I am not aware of enrichment tool for it

ADD REPLYlink written 2.9 years ago by dago2.5k

With goseq you can use the reactome db (next to kegg and GO.db). But multiple enrichment only works with one organism, because you need to have the full genome. If you have your own gene2go or gene2pathway file, you can use that instead.

ADD REPLYlink modified 14 months ago • written 14 months ago by Benn7.7k

Thanks for your reply! I looked into clusterProfiler but it seems like enrichKEGG() only works when you specify a species, by having to set organism="". Do you agree?

ADD REPLYlink written 2.9 years ago by miyakokodama20
0
gravatar for EagleEye
2.9 years ago by
EagleEye6.4k
Sweden
EagleEye6.4k wrote:

Gene Set Clustering based on Functional annotation (GeneSCF)

Works for known species/ organisms.

ADD COMMENTlink modified 2.9 years ago • written 2.9 years ago by EagleEye6.4k

Thanks! Unfortunately my species is not on the list so it seems like I cannot use it, as I need to set the -org parameter. If you have any other suggestions, please feel free to let me know and thanks!

ADD REPLYlink written 2.9 years ago by miyakokodama20
0
gravatar for muhammadaqeel24
14 months ago by
muhammadaqeel240 wrote:

Use enricher() in clusterProfiler R package for this purpose. It shapes the things which are novel like yours.

ADD COMMENTlink written 14 months ago by muhammadaqeel240
0
gravatar for pablo61991
14 months ago by
pablo6199170
pablo6199170 wrote:

I'm also very interested in this topic, I didn't find a really good approach so please if you finally solve that please share the method.

Just if wanna try something which partially works for me: kobas.cbi.pku.edu.cn/help.php

The only problem is you need to repeat the annotation step and maybe you don't want it. For real data you usually need to install the tools in a local machine, instead of use the webtool.

ADD COMMENTlink written 14 months ago by pablo6199170
0
gravatar for bigmawen
14 months ago by
bigmawen310
United States
bigmawen310 wrote:

You can use gage package in R/Bioconductor:
http://bioconductor.org/packages/release/bioc/html/gage.html

The K-numbers are KEGG orthology gene IDs. you can generate gene set data for KO using kegg.gsets() function with species='ko'. BTW, KEGG data is constantly updated, so is the gene set data generated using kegg.gsets.

You may also use pathview web server for pathway analysis if you are more used to GUI program.
http://pathview.uncc.edu/

ADD COMMENTlink written 14 months ago by bigmawen310
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 964 users visited in the last hour