Phylogeny based on presence/absence of genes
3
0
Entering edit mode
6.3 years ago

Hello,

I have a binary matrix (0 and 1) corresponding at the presence/absence of genes and I want to perform a phylogenetic analysis. I know RAxML and PARS (phylip package) can deal with such data. However, I want to know if someone knows alternatives.

 

Thank you!

Antony

phylogeny phylip gene • 4.2k views
ADD COMMENT
3
Entering edit mode
6.3 years ago
dago ★ 2.7k

I guess that you should consider your data as a binary matrix (presence/absence) and perform a cluster analysis accordingly.

There are many tutorials on how to do that in R (here and here).

I guess that when you talk about phylogeny you talk about a more complex situation. You have a sequence alignment and according with the method you use you need to reconstruct the probable evolutionary processes that are responsible for the divergence of the resequences. This is done considering different model of evolution and the frequencies with which a change happens to a specific position in your sequences. Then you need to create a tree like representation of the relationship amongst your sequences and also this can be done using different methods not only based on similarity onr dissimilarity values amongst your sequences.

 

Here there are a bit more details:

How to perform phylogeny analyses

ADD COMMENT
1
Entering edit mode
6.3 years ago
Naren ▴ 900

Try using PAST this can form neighbor joining tree from multivariate analysis.

ADD COMMENT
0
Entering edit mode

Wow I did not that they made a version 3.0
 

ADD REPLY
0
Entering edit mode
6.3 years ago

Thank you for your answers. A clustering with PAST or R seems appropriate. However, which clustering index is the most appropriate knowing that my binary matrix corresponds at the presence/absence of genes in a set of genomes. I know it was discussed here: Similarity Measures Appropriate For Hierarchical Clustering On Gene Content/Binary Data

but I want to know your opinions.

 

 

 

 

ADD COMMENT
0
Entering edit mode

Hard to say in my opinion. It depends on how you want to treat your data. You can read more here and here
 

ADD REPLY

Login before adding your answer.

Traffic: 1306 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6