Question

Generate connected nodes as list of tuples by expanding newick tree

0

Entering edit mode

5.9 years ago

jacob_peters • 0

I am wondering if it is possible to get node connectivity from newick formatted tree. e.g (((t10,t6),(t9,((((t5,t1),t4),t3),(t7,t2)))),t8);. The example image is shown using a length of 1. Example tree image . How to I get a list of nodes that are connected. i.e relationship for each node. t5 and t1 are connected to t4, t4 connected to t3. so t5 and t1 are leaf nodes. Representing them in a table in the form looks like this

       t1   t2    t3    t4    t5   t6    t7    t8    t9      t10
t1                     (t1,t4)
t2                                                 (t2,t9)
t3                                                 (t3.t9)
t4              (t4,t3)
t5                     (t5,t4)
t6                                           (t6,t8)
t7                                                  (t7,t9)
t8
t9                                           (t9,t8)
t10                                          (t10,t8)

What I want is a list of tuples of the connected nodes e.g

list = [(t1,t4), (t2,t9), (t3.t9), (t4,t3), (t5,t4), (t6,t8), (t7,t9), (t9,t8), (t10,t8)]

My intuition of the connections maybe wrong but this is what I am trying to achieve. Is this possible? At end, I just want the leaf nodes to take on values of their parent nodes.

phylogeny trees python • 2.0k views

ADD COMMENT • link 5.9 years ago by jacob_peters • 0

0

Entering edit mode

Sorry about that. Like I said, I maybe wrong with the connections. I am just trying to see the possibility of getting something like a pair of edge list for parent child relationship. Similar to how pedigree is represented. I probably need more clarification on the newick format. I am actually looking at ETE right now

ADD REPLY • link 5.9 years ago by jacob_peters • 0

0

Entering edit mode

This shouldn't be posted as an answer as it disrupts the flow of the thread, but I'll put my comment here to keep it semi-ordered.

A newick tree basically is just a representation of nodes. There isnt a whole lot more to it than that. Optionally they can have branch lengths and bootstrap/node values.

So as I understand it you're really interested in representing a tree as a matrix?

In which case wouldn't it make more sense to have the matrix row/column intersections correspond to a number which is how many nodes apart X and Y are, as opposed to (tx, ty)?

e.g.

(A:1, (B:1, C:1)); is the simplest possible tree displayed in a basic Newick format. It tells you that firstly B and C are connected at a node, and then A is the ancestral node to that. It would look like:

   /-A
--|
  |   /-B
   \-|
      \-C

ADD REPLY • link 5.9 years ago by Joe 21k

0

Entering edit mode

This general idea might be vaguely useful: though its C code, so I have to tap out at this point...

https://www.geeksforgeeks.org/construct-ancestor-matrix-from-a-given-binary-tree/

ADD REPLY • link 5.9 years ago by Joe 21k

score 0 · Answer 1 · 2018-05-31

0

Entering edit mode

5.9 years ago

Joe 21k

What are you defining as connected nodes? (t10,t8) are 2 nodes apart in your bottom tree for example, but they're in your list.

If in doubt, the ETE Toolkit will almost certainly have workable tools for traversing the tree and figuring out what you want.

ADD COMMENT • link 5.9 years ago by Joe 21k