How to obtain statistical support for a certain hard clade when a set of phylogenetic trees is given?
1
0
Entering edit mode
5.1 years ago

I have a set of trees (e.g., MrBayes output *.run1.t, *.run2.t) with many taxa: A, B, C, .... Z.

I need to infer statistical support for a certain clade with the fixed topology ((E,F),G).

Sumtrees (Dendropy package) is a useful tool, but the only thing it can provide in my case is the cumulative number of trees which contain ((E,F),G), or (E,(F,G)), or (E,F,G).

Is it possible to get a number of trees with the exact clade ((E,F),G) excluding trees with (E,(F,G)) or (E,F,G)?

Thank you in advance.

clade topology • 746 views
ADD COMMENT
0
Entering edit mode
5.1 years ago
Joe 21k

You can follow the approach in this previous answer of mine: Detect trees (newick) with specific topology

If you just treat the tree as a string, search exactly for the number of trees containing the string ((E,F), G). You can convert to a cladogram, or use some regex magic to get around the branch lengths and bootstrap values within those clades.

ADD COMMENT
1
Entering edit mode

Dear Joe, this is indeed an easy and ingenious way, thank you.

ADD REPLY
1
Entering edit mode

If an answer was helpful, you should upvote it; if the answer resolved your question, you should mark it as accepted. You can accept more than one if they work.
Upvote|Bookmark|Accept

ADD REPLY

Login before adding your answer.

Traffic: 1606 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6