Question: Bacterial known pathway - the easiest way to find and download 1:1 orthologs to each ferment involved
gravatar for natasha.sernova
18 months ago by
natasha.sernova3.5k wrote:

Dear all, I am studying a well known bacterial pathway. I've checked a couple of enzymes here ( - I've found more than a thousand orthologs, but I need just orthologs in gram(+) bacteria. It's possible to do it manually, but I expect to spend a week on it. How can I do it computationally? OMA provides a lot of file-formats for its output. But I am pretty ignorant in HTML-files and Python, unfortunately. Thank you very much for any help! Natasha

oma genome • 574 views
ADD COMMENTlink modified 18 months ago by adrian.altenhoff620 • written 18 months ago by natasha.sernova3.5k

A quick look at the data files seems to indicate that this would not be a straightforward thing. You may want to write to OMA folks to see if they have a way to custom query their database on the backend to generate the data you are looking for.

A query with "gram positive" brings up this. Perhaps you could use that to get the sequence.

ADD REPLYlink modified 18 months ago • written 18 months ago by genomax69k

This is going to involve parsing the information from the available datasets. I suggest either parsing the "OMA groups" file in txt or xml or the "OMA Groups/Sequences in COGs format". However I could not find the important file listing the Gram positive bacteria. You may need to do this by using the NCBI taxonomy database.

ADD REPLYlink written 18 months ago by Joseph Hughes2.7k
gravatar for adrian.altenhoff
18 months ago by
adrian.altenhoff620 wrote:

As @Joseph Hughes points out, you will have to either use the REST API to search for the orthologs of your query genes and limit them to the gram-positive genomes (as far as my understanding goes these are essentially the Actinobacteria), or you parse the flat files that contain all the orthologs and filter the ones from your clade of interest. Both approaches require some scripting in your favorite language.

To get the set of Actinobacteria in OMA from the REST API you can use the following get-url:

Then, you can limit the orthologs (either pairwise or HOGs) to species belonging to set of Actinobacteria.

ADD COMMENTlink written 18 months ago by adrian.altenhoff620


ADD REPLYlink written 18 months ago by natasha.sernova3.5k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2225 users visited in the last hour