Question: Bacterial known pathway - the easiest way to find and download 1:1 orthologs to each ferment involved
gravatar for natasha.sernova
2.1 years ago by
natasha.sernova3.7k wrote:

Dear all, I am studying a well known bacterial pathway. I've checked a couple of enzymes here ( - I've found more than a thousand orthologs, but I need just orthologs in gram(+) bacteria. It's possible to do it manually, but I expect to spend a week on it. How can I do it computationally? OMA provides a lot of file-formats for its output. But I am pretty ignorant in HTML-files and Python, unfortunately. Thank you very much for any help! Natasha

oma genome • 663 views
ADD COMMENTlink modified 2.1 years ago by adrian.altenhoff630 • written 2.1 years ago by natasha.sernova3.7k

A quick look at the data files seems to indicate that this would not be a straightforward thing. You may want to write to OMA folks to see if they have a way to custom query their database on the backend to generate the data you are looking for.

A query with "gram positive" brings up this. Perhaps you could use that to get the sequence.

ADD REPLYlink modified 2.1 years ago • written 2.1 years ago by genomax78k

This is going to involve parsing the information from the available datasets. I suggest either parsing the "OMA groups" file in txt or xml or the "OMA Groups/Sequences in COGs format". However I could not find the important file listing the Gram positive bacteria. You may need to do this by using the NCBI taxonomy database.

ADD REPLYlink written 2.1 years ago by Joseph Hughes2.8k
gravatar for adrian.altenhoff
2.1 years ago by
adrian.altenhoff630 wrote:

As @Joseph Hughes points out, you will have to either use the REST API to search for the orthologs of your query genes and limit them to the gram-positive genomes (as far as my understanding goes these are essentially the Actinobacteria), or you parse the flat files that contain all the orthologs and filter the ones from your clade of interest. Both approaches require some scripting in your favorite language.

To get the set of Actinobacteria in OMA from the REST API you can use the following get-url:

Then, you can limit the orthologs (either pairwise or HOGs) to species belonging to set of Actinobacteria.

ADD COMMENTlink written 2.1 years ago by adrian.altenhoff630


ADD REPLYlink written 2.1 years ago by natasha.sernova3.7k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1100 users visited in the last hour