I'm struggling to understand application of branch, site and branch-site models to detect selection pressure across nucleotide coding sequences (CDS) of the set of genomes i'm working with. After reading of PAML manual and several codeml tutorials certain things are still not completely clear for me. Could you clarify these for me please?
Are the
site modelsonly used to detectselection pressurein certain genes of a genome? I.e. there are 6 alignments for different sets oforthologousgenes and i'd like to know which alignments (genes) are under selection pressure and which are not. Thebranchandbranch-sitemodelscould not be applied for that. Am i right?Why we need a phylogenetic tree even for application of the
site models? How tree topology will influence on the results of the analysis by means ofsite models?Are the
branchandbranch-sitemodelstypically applied for detecting differences inselection pressurebetween branches of the phylogenetic tree? If so, which the most important differences in usage betweenbranchandbranch-sitemodels? I mean the typical use cases for both model classes.Are the
clade model Candclade model Drepresent a two separate model classes or just a subclasses ofbranch-site model? What is the typical application for the clade model Candclade model D?
Thanks,
Denis