Question

How To Edit Phylogenetic Trees As Per Required Output And What Are The Softwares Available To Do The Same

0

Entering edit mode

11.9 years ago

H@rry ▴ 30

Hi,

I am working in evolutionary biology of a family of proteins in plants, Human and Yeast and likewise have few query for the experts as follows

Is good idea to modify the phylogenetic tree as per our own requirements as some members are not clustered together from the same family. If Yes, please recommend me some free software for the same and If No, then what could be the possible explanation of such study for the publication. How to explain the fact that some gene families are not close to each other or overlapped with the different gene family. I tried some software but couldn't able to obtain it as per my wish.

Any suggestion will be appreciable

Thank you in advance

• 7.9k views

ADD COMMENT • link updated 11.9 years ago by DG 7.3k • written 11.9 years ago by H@rry ▴ 30

Ram · Answer 1 · 2013-09-02

There are numerous way to manipulate a phylogenetic tree to your liking -- both through programs (I like FigTree, but I have a co-worker who swears by TreeView and there are many others) and easily by editing your tree file in text format.

I'm actually concerned you would want to modify your phylogenetic tree -- to me this is akin to saying "I don't like how my data looks so I will modify the figures to show what I think it should look like."

If you think something looks suspect in your phylogenetic tree, I would first go evaluate the process of constructing your phylogenetic tree -- from sequence selection to alignment (it's very important to inspect by eye!) to the analysis process -- and inspect each step of your analysis. From my experience many people do not know how to properly construct a phylogenetic tree and I see many extremely poor examples in the literature. If your tree does not look the way you would "expect" your first step should be to assess your data analysis pipeline at every step of the analysis.

If after a series of quality control steps you find the same "surprising" clustering with branches, then you should attempt to come up with a reasonable explanation of why. Since this is your study gene and you know the most about it you will be the best candidate to explain why you are seeing the clustering you find in your robust phylogenetic analysis.

Ram · Answer 2 · 2013-09-02

Like Josh I would express some concern at what appears to be a desire to use a program to fix your tree. Single gene phylogenies will not always reflect the expected organismal phylogenetic relationships for any number of reasons. Some of them biological, and some due to potential methodological issues. When you start looking at gene families this is particularly true. Multiple duplications of some genes likely exist, which can confuse analysis if you are not careful with your selection of orthologs and paralogs.

Improper taxon sampling can create issues (missing or rogue taxa can both create problems, and for different reasons). Differential gene loss, gene replacement, laterla transfer... all of these can create legitimate biological confusions. Long branch attraction or rejection can cause sequences to cluster together when they shouldn't. Composition effects can cause the same thing through convergent evolution.

And, keep in mind that your underlying annotations may possibly be incorrect as well. I've seen it happen in many datasets.

My suggestions are to evaluate first your method of reconstructing the ohylogeny. For instance you should be using a full maximum-likelihood based method (or bayesian) instead of simple neighbor-joining methods. If you are using an ML or Bayesian method then evaluate the model you are using (LG versus WAG or JTT for instance with proteins). Using a program like ModelTest may take time but it will give you some insight if you are misspecifying your model.

Go through and check your ortholog/paralog selections carefully. Add taxa if you are undersampling diversity (this is usually one of my main criticisms of many papers). If some taxa look problematic do some tests to see why they are causing problems. If you do decide to remove any taxa or sequences you probably need to explain why in the publication and have a good justification for it and if you do remove sequences/taxa you need to actually redo the phylogeny as it may effect other branches and branch lengths.