Question: Protein Phylogenetic Analysis
gravatar for User 0063
8.4 years ago by
User 0063210
User 0063210 wrote:

I've to infer a phylogenetic analysis on protein sequences. I've seen that some sequences are "conceptual translation". Do you suggest me to retrieve and use the genetic sequences rather than the proteic ones?

best regards

ADD COMMENTlink modified 3.8 years ago by Biostar ♦♦ 20 • written 8.4 years ago by User 0063210
gravatar for Dave Lunt
8.4 years ago by
Dave Lunt2.0k
Hull, UK
Dave Lunt2.0k wrote:

It doesn't matter where the sequence originated, if you have to do a protein phylogeny then do it with amino acid sequences. If, when you have done that, there isn't enough variability to resolve the proteins in the way you want (ie they are too similar) then you could repeat the process with nucleotide sequences as you will get much more variation.

In truth you might find that almost all the amino acid sequences you are dealing with are "conceptual translations" of DNA sequences. Very few sequences originate from protein sequencing, and lots come from DNA sequences that have been translated. This doesn't matter for your purposes.

ADD COMMENTlink written 8.4 years ago by Dave Lunt2.0k
gravatar for Rm
8.4 years ago by
Danville, PA
Rm7.8k wrote:

Phylogenetic analysis using protein (20 alphabets) sequences gives better resolution than using corresponding gene sequence (DNA: 4 alphabets).

ADD COMMENTlink written 8.4 years ago by Rm7.8k

This isn't really right RaghuM. Amino acid sequence is better for some questions, worse for others. Taxa or genes that are very divergent may have too much homoplasy in the nucleotide sequence. Alternatively closely related stuff too little information in a.acids. It is very unlikely that many positions in a protein are free to vary between 20 amino acids.

ADD REPLYlink written 8.4 years ago by Dave Lunt2.0k

Choosing the right algorithm, shouldn't DNA sequences give better results because they contain more information (4^3 bits at one aa position)? Also, parameters like omega = dN/dS can only be calculated when using DNA sequences.

ADD REPLYlink written 8.4 years ago by Michael Schubert6.9k

which one to prefer: Its a context dependent whether DNA or Protein, I do agree for ka/ks and other calculations DNA will be better provided sequences are closely related. But for distantly related one Protein analysis will be better. but any how question was "protein phylogeny"

ADD REPLYlink written 8.4 years ago by Rm7.8k
gravatar for Dataminer
8.1 years ago by
Dataminer2.6k wrote:

This article (lin k text) may answer your doubt. Without doubt protein sequences have less signal to noise ratio as compared to DNA sequences and furthermore protein alignments also benefits from substitution matrices and moreover Phylogenetic links disappear more rapidly between DNA sequences (discrepancy) than between protein sequences.

Details about the best approach to build a phylogenetic tree and the advantages and disadvantages of using DNA and protein sequences can be read from, " Bioinformatics and functional genomics" Jonathan Pevsner.

Wish u luck :)

ADD COMMENTlink written 8.1 years ago by Dataminer2.6k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1931 users visited in the last hour