Which tools to group orthologous proteins?
1
0
Entering edit mode
6.6 years ago
biolab ★ 1.3k

Hi everyone

I have many protein sequences from new species.  The data for these species is more complete than those collected in Genbank db.   I would like to group orthologous proteins into clusters.  Which tools can be applied for this work?  I very appreciate your advices. Thank you greatly!

orthologous • 1.6k views
ADD COMMENT
1
Entering edit mode

This may not be what you're looking for, but I like to do rough sequence clustering on my protein sets using CD-hit. http://weizhongli-lab.org/cd-hit/. It's very fast and can give you an idea of your protein demographics. This is what UniProt uses to cluster. 

ADD REPLY
0
Entering edit mode

Thank you for your comment that is very helpful.

ADD REPLY
2
Entering edit mode
6.6 years ago
dago ★ 2.7k

Look here, maybe it helps:

A: Detecting unique genes or non-homologus genes in a species

 

ADD COMMENT
1
Entering edit mode

BRBH is a great way to establish orthologous genes, but this process can become computationally difficult if you are working with eukaryotic species.

OP, you should still take the currently available annotations into consideration. They can serve as sanity checks and you may be able to correct problems in whatever is currently available.

ADD REPLY
0
Entering edit mode

Thanks, joe.cornish826. Your advices is helpful!

ADD REPLY
0
Entering edit mode

Thanks a lot, dago.  I want to ask you one more question.  The methods in OrthoMCL and  InParanoid are a little bit complicated (OMCL needs mysql and other tools).    I want to use reciprocal best-hit BLAST approach, the simplest method.   But after BLAST, is there any perl script to group orthologous proteins?  THANKS!

ADD REPLY
0
Entering edit mode

Hi, dago, I downloaded get_homoilogous. Let me try it. THANKS!

ADD REPLY

Login before adding your answer.

Traffic: 1783 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6