Question: Processing KAAS output (lists of KO numbers) into straightforward knowledge
gravatar for antortjim
4.6 years ago by
Spain/Seville/Universidad de Sevilla
antortjim10 wrote:

Do you know any resource or software (Bioconductor, Biopython, etc) that can be employed for summarising KAAS output?

While conducting a mock research following the Trinity-Transdecoder-KAAS pipeline based on the following paper. I have submitted the .pep file generated by Transdecoder to the Kegg  automatic annotation service (KAAS) and this has generated a list of KO (Kegg Orthology) numbers. Whereas I find it useful, I still don't see the point of such raw data. I would like to have it further summarised, for example, it would be great if I could get the number of pathways involved in each category (20 metabolism pathways, 30 genetics pathways and so on) or something like that. Really, what I would like is to perform a gene set analysis with this list of KO numbers and expect to see categorization. Correctly organized, this data could be plotted (pie plots for instance) in order to gain an insight into the kind of pathways and biological entities present in the sample.

I have made some research on biostars, and similar questions like this, this, this or this have been asked, but they either fail to answer to this question, or the proposed answer does not work anymore. I have also searched the Kegg website and I haven't found anything better than the KEGG Reconstruct Pathway, which renders a count of pathways within each subcategory that nevertheless has to be painstakingly parsed from html.

To sum up, I want to go from a huge KO numbers list to general statistics and counts. How can I get done? Thanks in advance.


kaas kegg • 2.6k views
ADD COMMENTlink modified 4.6 years ago by Nari880 • written 4.6 years ago by antortjim10

It's not KO, it's K0 K1 etc.

ADD REPLYlink written 4.6 years ago by Nari880
gravatar for Nari
4.6 years ago by
United States
Nari880 wrote:

Put those K numbers into the input box at bottom of the KEGG orthology search page . Try with less quantity first.

ADD COMMENTlink modified 4.6 years ago • written 4.6 years ago by Nari880

Indeed, it won't work in my computer if I input a list ok KEGG Orthology numbers bigger than 100. However I tried with less quantity and it did work, though none returned what I wanted. The one that came closest was the Map pathway option, which returns the name of every pathway along with the number of hits found on each. I would like to have the number of hits in pathways categories (i.e Metabolism, Genetic Information, Diseases, etc). Would that be possible? Thanks

ADD REPLYlink written 4.6 years ago by antortjim10

Yeah you can map those to catagories from this reference file

Available at KEGG Orhology website.

ADD REPLYlink modified 4.6 years ago • written 4.6 years ago by Nari880
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1383 users visited in the last hour