Phylogenetic analysis and Substitution rate calculation
Entering edit mode
5.7 years ago
SJ Basu ▴ 40

Hello people,

I have to calculate KaKs ratio. Here is what I have: 1. Assembled transcripts (using trinity) 2. 100 genes sequences of a specific gene family 3. 80 protein sequences from the before mentioned genes.

Q1: How do I make the dataset for phylogenetic tree ? Should I mix the 1. (of course transcripts that mapped to sequences in 1. ) and 2. OR should I take only the sequences from 1. that mapped to sequences in 2.

Q2: How do I calculate KaKs using my transcript sequences(1.) and reference protein sequence(3.) ?

I have studied the PAML-PAL2NAL and MEGA5 pipeline and they perform multiple sequence alignment between same type of sequences(i.e. either mrna or proteins), which is where my case differs !!! Should I convert the selected transcripts from 1. to protein then perform msa ????

Any suggestion is highly valued and Thanks in advance

KaKs DnDs Substitution Phylogenetic • 1.9k views
Entering edit mode
5.7 years ago
Joseph Hughes ★ 3.0k

1) Convert all your transcripts to protein

2) Do a multiple sequence alignment of your protein sequences

3) Use PAL2NAL to convert your protein alignment to a codon alignment using the original transcripts

4) Use you codon alignment to produce a maximum likelihood tree

5) Use your codon alignment and your ML tree in PAML to obtain the KaKs ratio

Keep in mind that the better your sequence sampling, the more robust the results will be.

Entering edit mode

@Joseph Hughes..Thank you sir...clear instruction, very helpful !!!


Login before adding your answer.

Traffic: 1644 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6