Question: Map Snp To Tree
0
gravatar for Amine
5.6 years ago by
Amine0
Amine0 wrote:

Hi all, I need some suggestions to map the number of SNPs on my tree. So the goal is to start from a sequence alignment and build a phylogenetic tree and I want to see the number of SNPs on each branch of this tree. Some people use mesquite for that but no one say how! it will be also nice to have the list of SNP positions for each branch. So if you know a way to do it, even with programming, please share you experience. Kind regards, Amine

tree snp mapping • 2.5k views
ADD COMMENTlink modified 5.6 years ago by Phil S.660 • written 5.6 years ago by Amine0
0
gravatar for Phil S.
5.6 years ago by
Phil S.660
Stuttgart, Germany
Phil S.660 wrote:

Hi,

maybe that's a bit late but you can use (with a litte programming) either of the following two approaches. 1. Quarttet mapping Paper - or you can use some kind of ANOVAWiki page

What you will do is you will exploit the theory of splits in a phylogenetic tree. The edge you want to describe is your split. That mean you not only split the tree but also the underlying MSA. So you get two chunks of sequences which are previously aligned. Now iterate over every position from the two chunks of sequences. And calculate in case of ANOVA the edit distance within and between you two chunks of sequences. Induce this into i.e. Fisher's Exact to get the positions which are responsible for the edge. Positions with an E-Value below threshold are the SNPs responsible for the edge. Using the quartet mapping you need to generate 4 different subtrees. From the split edge (the one you want to infer important SNPs) create left-top, left-bottom, right-top, right bottom subtree. Now try to incorporate the letter from the edge you are analyizing. To illustrate, you should create something like this:phy-tree-images The red edge is the edge you are investigating. Now if the distance from A/B to the letter of the red edge is shorter than the distance to C/D (or C/D is shorter than A/B) it supports the edge if not (which means they are equal) there is no support from this site for the edge. Hence you can as well induce a Fishers over that and you are good to go to create a subset. Care, this only handles binary SNPs.

I can't find the code atm... sry about that, don't hesitate to ask if it is still up to date...

ADD COMMENTlink modified 5.6 years ago • written 5.6 years ago by Phil S.660
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 506 users visited in the last hour