Question: Orthomcl For More Than Two Species
2
gravatar for figo
7.1 years ago by
figo200
figo200 wrote:

Hi All

I have run the orthomcl for 4 species and having the orthologous pairs file and group file. I am confused that from these files how I can take out the 4 species orthologous sequences as orthologous pairs file have only the paired information and group files contain in one group many sequences.

If some one has run this please let me know.

regards

orthomcl • 2.3k views
ADD COMMENTlink modified 7.1 years ago by Manu Prestat3.9k • written 7.1 years ago by figo200
1
gravatar for sentausa
7.1 years ago by
sentausa640
France
sentausa640 wrote:

Each line in the group files lists the orthologous proteins among your species. If you have 4 species, there should be 4 proteins listed in each line. If you have more than 4 proteins listed in each line, some proteins might be orthologous to other protein(s) in the same species. If you have less than 4 proteins listed in each line, some species might not have particular protein(s) orthologous to those of the other species.

ADD COMMENTlink modified 7.1 years ago • written 7.1 years ago by sentausa640

Than you for your reply . One more thing I wanted to ask that all the proteins which are in same group are othologous to each other. Say I have 6 orthologous proteins in the group such that A1,A2,B1,B2,C1,D1. So can I make my grouping as A1,B1,C1,D1 and A2,B1,C1,D1, A1,B2,C1,D1 and A2,B2,C1,D1 as four orthologous groups ? If it is yes then do I have any script to do it ?

ADD REPLYlink modified 7.1 years ago • written 7.1 years ago by figo200

I'm sorry I don't get what you wanted to ask. One line in the group file is one orthologous group.

ADD REPLYlink written 7.1 years ago by sentausa640

Sorry I am not able to make you understand .I try once again , since the group file contains the orthologos group in one line and it can be many like you said more then 4 less then 4 and say its is 10,000 groups . But the unique member orthologos group containing only 4 speices is very less (only 1000). So if I just want to make an alignment of 4 species orthologs I just have 1000 genes list. How i can use the the other 9000 groups there i know that i cannot use the groups which are having 3 or 2 species.

ADD REPLYlink written 7.1 years ago by figo200

But what information do you want to obtain exactly? If you want to know the orthologs between all 4 species, then you can only use those 1000 groups (I believe they're called "core" proteome). If you use all 10,000 groups, I think it's what they call "pan"-proteome among the 4 species. I myself used Orthomcl to spot proteins which are specific or otherwise absent in a bacterial species, compared to other closely related bacteria.

ADD REPLYlink written 7.1 years ago by sentausa640
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1738 users visited in the last hour