hi, everyone I am working on calculate the substitution rate of noncoding rnas now. if i have a multiple sequence alignment from three species, the topology of tree is ((1,2),3), how can i use baseml and the REV substitution model to get my desire result? i have tried the example 'brown.nuc' and 'brown.trees' in paml4.7 package, but i cannot make out which number is subtitution rate of that sequences. i have read a lot of papers, but the formula is too difficult to understand, can you give me suggestion?any reply will be appreciated!
So, it's been a while since I did this type of analysis, but there are two levels of modeling:
1) Modeling substitution rates between two samples (to try and take the probability of multiple substitutions at the same site into consideration). This wikipedia page talks about the models for doing this:
2) Using an outgroup (sample 3 in your case) to try and estimate the ancestral state (or, perhaps more precisely, the rate of divergence from an ancestral state). In this particular paper (looking at substitutions at intergenic and synonymous sites), this was done with a relative rate test. See links below for paper and description of strategy:
From a practical standpoint, I don't remember what specific program outputs look like. However, I know MEGA is typically one of the most popular packages for evolutionary genomics analysis, and I think it has fairly comprehensive documentation:
If you can get a hold of this textbook, it might also be helpful to you: