Question: How Do I Quantify Divergence Between A Given Set Of Species?
gravatar for Eminencegrise
7.4 years ago by
Eminencegrise210 wrote:

I have to construct simple tree clade between my set of species showing putative time of divergence? What tool would You recommend?

phylogenetics • 7.7k views
ADD COMMENTlink modified 5.1 years ago by aidan-budd1.9k • written 7.4 years ago by Eminencegrise210

Can you give us more details. What type of data you have ?

ADD REPLYlink written 7.4 years ago by Khader Shameer17k

I have exactly 20 species of plants and I want to construct a distance matrix, this is a side of my project, I do not want to use sequence data myself, rather rely on some used database as all I require is to have some sort of idea in mind, I would appreciate Your reply. How od I create a distance matrix in a high throuput manner rather than looking up manually?

ADD REPLYlink written 7.4 years ago by Eminencegrise210
gravatar for David W
7.4 years ago by
David W4.6k
New Zealand
David W4.6k wrote:

Do you want to do it well or quickly?

I'm not being facetious, determining divergence times is difficult and if you want to do it for publication you'll have to put some time an effort into doing it right. If, on the other hand, you want to something for your own use or an assignment you might get away with quicker effort.

No matter what you do, you will need to have either an estimate of the rate at which you sequence evolves (in changes per site per million years) or some nodes in your tree you can put a date on (fossils or a biogeographic split). If you want a rough idea of a date from that then MEGA (above) is as good as anything for getting dates.

If you need something a bit more serious then I would look to BEAST. The advantage here is you are including a lot more of the uncertainty in the tree and the molecular clock and the calibration points when you do the dating (of course, that will almost certainly mean wider error bars, but that's often the price of being honest ;).

ADD COMMENTlink written 7.4 years ago by David W4.6k
gravatar for Haibao Tang
7.4 years ago by
Haibao Tang2.9k
Richmond, CA
Haibao Tang2.9k wrote:

There is this timetree website that contains authoritative estimate on divergence (in terms of million years) between pairwise species. Otherwise, you'll have to use some molecular markers, e.g. 16S rDNA sequences, and construct the phylogeny yourself.

ADD COMMENTlink written 7.4 years ago by Haibao Tang2.9k


i wanted to get a rough etimate of divergence times for my set of sequences. i used tme tree to get rough estimate of time lengths. since MEGA6 comes with RelTime for estimating divergence times i thought may be that is more reliable. but i dunno how to use it. is is asking for minimum and maximum diverence times which i dunno. can anyone help me with it? how do i proceed?

ADD REPLYlink written 2.5 years ago by sansritisinha0
gravatar for Sashi Kiran Challa
7.4 years ago by
Portland, Oregon
Sashi Kiran Challa300 wrote:

Please take a look here for building trees using MEGA.

ADD COMMENTlink written 7.4 years ago by Sashi Kiran Challa300

the distance tree created by MEGA is in CSV format..... How do I get .dist format to use with MOTHUR

ADD REPLYlink written 19 months ago by ag1805x40
gravatar for Khader Shameer
7.4 years ago by
Manhattan, NY
Khader Shameer17k wrote:

Already nice answers, here is my thoughts:

Following David's suggestions, I would suggest that you need to put in more thoughts before attempting to quantify the divergence between a given set of species. Getting evolutionary distance in scale of time using tools discussed may provide an abstract way of the diversity pattern based on the data you have. If you are new to phylogeny tree implementation, I strongly recommend a quick read of this article.

In a nutshell: From a bioinformatics perspective, if you are looking at a set of plant species to get a "pattern of divergence" based on a distance matrix. You need to define a dataset (for example sequence (protein, dna or rna) then follow a typical phylogeny analysis pipeline.

Select a reference sequence from each species > BLAST > Alignment > Phylogeny analysis

For phylogeny analysis you may either use tools discussed here or go with the classic tool Phylip. A typical workflow for Protein sequence using Phylip to generate a distance matrix which can be visualised using phylogeny visualization tools will be as follows.

alignment -> seqboot -> protdist -> neighbor -> consense

This will give you an abstract way of divergence among the plant species based on the sequence.

If you just need to get an idea where your species of interest in the tree of life without the sequence based data you may use Timetree as suggested by Haibao.

ADD COMMENTlink modified 7.4 years ago • written 7.4 years ago by Khader Shameer17k
gravatar for Larry_Parnell
7.4 years ago by
Boston, MA USA
Larry_Parnell16k wrote:

To Khader's answer, I would add that the dataset to examine can also be the entire genome - specifically the organization of the genome. In similar analyses we have done (with plant genome data), we would ask a question along the lines of how many pieces of contemporary genome X need to be rearranged to compose a contemporary genome Y? Certainly by far the most popular way to measure the "distance" between organisms is in millions of years ago (when they shared a common ancestor). Given that plant genomes have seen tremendous change over time - especially in terms of genome rearrangements and genome duplication/loss - one might very well consider measuring distance between species in terms of large-scale genome changes. For example, one reason wheat is so different is because it is hexaploid. Z. mays saw genome duplication events as well.

In other words, don't lose sight of the bigger picture, often a biological picture, when embarking on the analysis.

ADD COMMENTlink written 7.4 years ago by Larry_Parnell16k
gravatar for aidan-budd
5.1 years ago by
aidan-budd1.9k wrote:

Jeff Thorne has stopped focusing on this kind of thing in his work, but has some papers that you might like to look through for help understanding some of the issues involved - he has a link to his publications from his webpage:

Jeff really knows what he's talking about, so he's a good place to go for an authoritative take on this topic

ADD COMMENTlink written 5.1 years ago by aidan-budd1.9k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 597 users visited in the last hour