Question: Low Support Value Of Phylogenetic Trees
0
gravatar for Dejian
9.3 years ago by
Dejian1.3k
United States
Dejian1.3k wrote:

The support value of my phylogenetic tree is very low. What are the possible reasons? How to improve the stability of the tree? (The peptide sequences were retrieved from the gene set of different insect genomes.)

phylogenetics statistics • 7.4k views
ADD COMMENTlink modified 9 months ago by Biostar ♦♦ 20 • written 9.3 years ago by Dejian1.3k
1

HI

I appreciate to know how to read and interpret-ate the phylogenetic tree for viruses?

What does the numbers for bootstraps mean and how interpreted?

Regards

Oumed

ADD REPLYlink modified 15 months ago by zx87549.9k • written 8.9 years ago by Oumed30

The bootstrap values signify how sure one can be that the tree is correct at any given node. A tree is calculated by looking at existing molecular data, and the more data one can gather, the easier it will be to relate the sequences in a highly-confident phylogeny. If the sequence evidence is not good enough, the bootstrap values will be low.

ADD REPLYlink written 8.9 years ago by Ahdf-Lell-Kocks1.6k
3
gravatar for Stefano Berri
9.3 years ago by
Stefano Berri4.2k
Cambridge, UK
Stefano Berri4.2k wrote:

Tell us more. How do you produce the tree?

In general, you should (1) get the protein sequences*, (1a) remove pseudogenes if you can, (2) align them (clustalX), (3) manually select conserved region(s), (4) iterate between points (2) and (3) till it is stable and then (5) bootstrap, (6) produce trees and (7) consensus.

I suspect you might have skipped step (3) and (4). If, after doing so, you still have low bootstrap values, I suspect you'll have some branches with good support but lower values in the "high branches". Focus on clusters of well conserved genes, remove them, cluster them alone and re-cluster the other with only one or two genes from each group.

Phyolgenetic treee are a bit of a craft...

  • as lh3 says in the comment below, you could also use DNA. DNA could improve only if the protein sequences are all VERY similar (like 95% or more identities).
ADD COMMENTlink modified 9.3 years ago • written 9.3 years ago by Stefano Berri4.2k

Using "the protein sequences" is not always the right thing to do. It depends. See also this question.

ADD REPLYlink modified 14 months ago by Ram32k • written 9.3 years ago by lh332k

There are programs automatically trimming poorly aligned regions. There was a question about this. I could not find it right now.

ADD REPLYlink written 9.3 years ago by lh332k

@lh3 trimal or gBlock can be used for trimming the alignment. I recommend trimal.

ADD REPLYlink modified 14 months ago by Ram32k • written 9.3 years ago by Leszek4.1k
1
gravatar for Leszek
9.3 years ago by
Leszek4.1k
IIMCB, Poland
Leszek4.1k wrote:

what is you gene of interest, or family of interest? if you are dealing with completely sequenced species, you can try to browse phylomeDB and Ensembl for Metazoa for you genes of interest. Both offer good quality of reconstructed alignments and trees.

ADD COMMENTlink written 9.3 years ago by Leszek4.1k
0
gravatar for Tancata
9.3 years ago by
Tancata210
Newcastle, UK
Tancata210 wrote:

Another question is how much data do you have (i.e. how many well-aligned positions?). If it's e.g. a single-gene alignment, it might not contain much information about the poorly-supported branches and you wouldn't have really been doing anything wrong. Poorly-aligned (or very variable) sequences, or sequences that are almost identical, might also not "be able" to resolve many of the branches on the tree. Or the sequences might be evolving in an unusual way.

ADD COMMENTlink written 9.3 years ago by Tancata210
0
gravatar for Cmzmasek
9.2 years ago by
Cmzmasek10
San Diego, CA, USA
Cmzmasek10 wrote:

Sometimes it helps to remove the most divergent sequences. Also the quality of the MSA is of uttermost importance, I recommend to use MAFFT or ProbCons to calculate it.

ADD COMMENTlink written 9.2 years ago by Cmzmasek10
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2868 users visited in the last hour
_