Orthomcl For More Than Two Species
1
2
Entering edit mode
11.3 years ago
figo ▴ 220

Hi All

I have run the orthomcl for 4 species and having the orthologous pairs file and group file. I am confused that from these files how I can take out the 4 species orthologous sequences as orthologous pairs file have only the paired information and group files contain in one group many sequences.

If some one has run this please let me know.

regards

orthomcl • 3.2k views
ADD COMMENT
1
Entering edit mode
11.3 years ago
sentausa ▴ 650

Each line in the group files lists the orthologous proteins among your species. If you have 4 species, there should be 4 proteins listed in each line. If you have more than 4 proteins listed in each line, some proteins might be orthologous to other protein(s) in the same species. If you have less than 4 proteins listed in each line, some species might not have particular protein(s) orthologous to those of the other species.

ADD COMMENT
0
Entering edit mode

Than you for your reply . One more thing I wanted to ask that all the proteins which are in same group are othologous to each other. Say I have 6 orthologous proteins in the group such that A1,A2,B1,B2,C1,D1. So can I make my grouping as A1,B1,C1,D1 and A2,B1,C1,D1, A1,B2,C1,D1 and A2,B2,C1,D1 as four orthologous groups ? If it is yes then do I have any script to do it ?

ADD REPLY
0
Entering edit mode

I'm sorry I don't get what you wanted to ask. One line in the group file is one orthologous group.

ADD REPLY
0
Entering edit mode

Sorry I am not able to make you understand .I try once again , since the group file contains the orthologos group in one line and it can be many like you said more then 4 less then 4 and say its is 10,000 groups . But the unique member orthologos group containing only 4 speices is very less (only 1000). So if I just want to make an alignment of 4 species orthologs I just have 1000 genes list. How i can use the the other 9000 groups there i know that i cannot use the groups which are having 3 or 2 species.

ADD REPLY
0
Entering edit mode

But what information do you want to obtain exactly? If you want to know the orthologs between all 4 species, then you can only use those 1000 groups (I believe they're called "core" proteome). If you use all 10,000 groups, I think it's what they call "pan"-proteome among the 4 species. I myself used Orthomcl to spot proteins which are specific or otherwise absent in a bacterial species, compared to other closely related bacteria.

ADD REPLY

Login before adding your answer.

Traffic: 2008 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6