Ka/Ks (Or Dn/Ds) Calculation From Sequenced Transcriptome Contigs
2
1
Entering edit mode
11.4 years ago
tom.newport ▴ 10

I have around 25,000 contigs each for two species. I'm trying to calculate Ka and Ks (or Dn and Ds) for potential orthologs between species, and potential paralogs within species. I have a list of contigs which I want to calculate Ka and Ks for, which looks like:

Species1_Contig12, Species1_Contig98,0
Species2_Contig16_Species1_Contig24,1

(The number signifies the comparison being made - 0 = species 1 to species 1, 1=species 1 to species 2, 2 = species 2 to species 2)

I also have a FASTA file with the consensus sequences for each contig. Names are the same in the FASTA file as in the comparison file, and those sequences have been annotated against other sequences by BLASTx, which suggests which reading frame I should be looking at.

I would like to calculate Ka and Ks between the pairs in the file, but I have no idea how to do it - PAML has been suggested, but the data isn't in the right format.

Can anyone offer me any pointers on how to go about this?

paml • 5.3k views
ADD COMMENT
1
Entering edit mode
11.3 years ago
Raygozak ★ 1.4k

You can look into the Los Alamos tool (http://www.hiv.lanl.gov/content/sequence/SNAP/SNAP.html) for conservation analysis, and for the alignment, this might help you: http://www.expasy.org/genomics/sequence_alignment . You want to look into the codon suite to see if it helps you.

ADD COMMENT
0
Entering edit mode
4.6 years ago
Adrian Pelin ★ 2.6k

If you can generate a translational alignment for each of your suspected orthologous group, you can use codeml

ADD COMMENT

Login before adding your answer.

Traffic: 1993 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6