Question: PAML needs branch labels in tree
gravatar for miles.thorburn
22 months ago by
miles.thorburn110 wrote:

I am trying to use PAML for the first time, and it's thrown up a few errors I am struggling to fix. The manual is particularly tricky to read, but I've solved most of the other problems I've come across. I am aiming to use it to calculate dN/dS for sets of genes. I used RAxML to generate my trees in Newick format, but the error message I get after executing codeml is this:

Error: need branch labels in the tree for the model..

My tree looks like this:


And my control file looks like this:

seqfile = /data/12_PAML/Input/Chr1/CA_R_ENSGACG00000022906_pruned.fa
outfile = /data/12_PAML/Output/Chr1/CA_R_ENSGACG00000022906_Codeml_Output.txt
treefile = /data/12_PAML/Input/Chr1/CA_R_ENSGACG00000022906_pruned.tre

noisy = 9
verbose = 1
runmode = 0

seqtype = 1
CodonFreq = 0
model = 2
NSsites = 2
icode = 0
fix_kappa = 0 
kappa = 1
fix_omega = 0
omega = 1
cleandata = 1

I've tried looking for similar answers and Ziheng has posted about similar problems saying to look at the examples folder. In it you get a trees like this:

((1,2) #1, ((3,4), 5), (6,7) ); 
((_10_H._fulgens, (((__4_H._kamtschatkana, (__1_H._rufescens,
(__2_H._sorenseni, __3_H._walallensis))) #1, (__5_H._sieboldii,
(__6_H._discus_hannai, __7_H._gigantea))), (__8_H._corrugata,
__9_H._cracherodii))), _25_H._iris, ((_17_H._pustulata,
(_24_H._t.coccinea, _23_H._t.tuberculata)), (_22_H._australis,
((_18_H._midae, ((_11_H._roei, (_12_H._scalaris, _13_H._laevigata)), 
(_14_H._cyclobates, (_15_H._rubra, _16_H._conicopora)))),
((_19_H._ovina, _21_H._varia), _20_H._diversicolor)))));

I'm struggling to see the difference. I've tried removing the floating point values and running it again, but to no avail. Any help would be very much appreciated. And thank you for your time.

newick tree paml • 1.1k views
ADD COMMENTlink modified 21 months ago • written 22 months ago by miles.thorburn110
gravatar for miles.thorburn
21 months ago by
miles.thorburn110 wrote:

Turns out this was a very simple answer. For using any of the PAML models with selection, you need to designate which branch(es) on your gene tree or phylogeny where selection differs. The notation is #1 or #n if there are n branches with different strengths of selection.

ADD COMMENTlink written 21 months ago by miles.thorburn110
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1245 users visited in the last hour