Entering edit mode
                    7.5 years ago
        kamel
        
    
        ▴
    
    70
    Hi, I used proteinortho on 15 proteomes to extract the orthologue sequences a. I got a matrix prteinortho.txt. could you tell me how i can extract the orthologous sequences between the 15 species to build a phylogeny. Thank you in advance for your help.
# Species   Genes   Alg.-Conn.  Sample1.fasta    Sample2.fasta  Sample3.fasta   Sample4.fasta    Sample 5.fasta Sample6.fasta   Sample7.fasta    Sample8.fasta Sample9.fasta...... Sample 15
15 15   1   FKA_08408-RA     ZKA_09992-RA   FKC_10499-RA    FG005_03202-RA  FKN_11022-RA    FKR_00001-RA    FG851_09649-RA  FKL_12190-RA    FKT_08425-RA
Something is missing.
I forgot to cut the first column of matrix.
How do you define 'orthologous' ? only 1-to-1 ? also 1-to-many? many-to-many?
according to the manual: the matrix is based on the algebraic connectivity of the respective group. A connectivity of 1 indicates a perfect dense group with each gene similar to each other gene.
Hi h.mon I saw this note in the manual but I did not find an example of use for grap_proteins.pl (is it necessary for example that I give the matrix with the 15 proteomes???,)
Please use the
ADD COMMENTbutton when commenting on someones answer.Did you try to run the script without arguments? It should output some help:
You can also use orthofinder.py tool to get ortholog proteins in your data set. And you will be able to see how many single copy (one-to-one) orthologs are in your dataset. In phylogenetic analyses based on ortholog proteins mostly single copy orthlolog proteins are used.
I will try to use it and come back to you. thank you