Question: How can I identify all homologues between two fully assembled genomes?
gravatar for ChrisM
21 months ago by
ChrisM10 wrote:

I'm trying to identify the homologues of all genes between two fully assembled bacterial genomes (e.g. Bacillus subtilis and Staphylococcus aureus) in batch. It looks like relevant data are available on the NCBI COG database, but their 2012 COG software won't compile so I haven't been able to attempt this particular method. As an alternative to using this software, is there a flat file of gene name to COG (or equivalent) that I could just download then match in Excel? I greatly appreciate all suggestions. Thank you in advance!

Regarding the Cog Software: I downloaded the source code from and I'm using Ubuntu 16.04 xenial.


g++ -O2 -c main.cpp

In file included from main.cpp:2:0:

os.h: In function ‘void myOpenDir(const char*)’:

os.h:13:16: error: ‘chdir’ was not declared in this scope

chdir(dirpath); ^

Makefile:9: recipe for target 'main.o' failed

make: * [main.o] Error 1

cogs bacteria • 622 views
ADD COMMENTlink modified 21 months ago by genomax74k • written 21 months ago by ChrisM10

You may be able to use MBGD to get this information.

ADD REPLYlink written 21 months ago by genomax74k

Thank you! Also, after much searching, I also found: which has been fantastic.

ADD REPLYlink written 21 months ago by ChrisM10
gravatar for h.mon
21 months ago by
h.mon28k wrote:

Add #include <unistd.h> to os.h, this should fix COGsoft compilation issue.

ADD COMMENTlink written 21 months ago by h.mon28k

I had the same problem, this solution worked for me. Thanks

ADD REPLYlink written 17 months ago by serg0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1120 users visited in the last hour