Question: Kozak sequence strength calculation
gravatar for Alexandros.Frydas
8 months ago by
Alexandros.Frydas20 wrote:

Hello everyone,

I want to assess the probability of a translation initiation site to be actually translated (Interesting in human genome). Based on literature I know that one of the most important requirements is a good Kozak sequence. Is anyone aware of a bioinformatics tool that could calculate Kozak concensus strength based on a given sequence?

Thanks in advance!


ADD COMMENTlink modified 16 days ago by Biostar ♦♦ 20 • written 8 months ago by Alexandros.Frydas20

I won't post this as an answer since it doesn't answer the question as posed, but is perhaps some food for thought:

To the best of my knowledge, recent studies suggest that pretty much the whole chromosome is transcribed to some degree at any given time, but the levels are obviously modulated. If Shine-Dalgarno sequences in bacteria (closer to my area of expertise) are any indication, there is a relationship between sequence 'identity' and transcriptional/translational activity - however it's very complicated, as the sequence of the regulatory sequences is not the be-all and end-all.

There may well be existing literature which has benchmarked the transcriptional activity of different sequences, but the problem is that the data will be essentially incomparable between different experiments due to batch effects.

A quick google for kozak sequence effects on transcription turns up articles such as:


I'm certainly not aware of any tools that do this already. In principle I can see that it would work as a predicable phenomenon. Something to the effect of creating a standard curve of sequence distance from the canonical sequence (perhaps), versus transcriptional activity, but it would need to be based on a carefully curated reference set of data. I'm sort of thinking along the lines of a transcription initiation equivalent of the "N-end rule", or a similar kind of benchmarking to the study which showed translational response to different start codons (spoiler alert, ATG doesn't mean sh*t!)

The only thing that springs to mind otherwise, would be the Softberry site, which has several tools for promotor sequence analysis etc (though this usually only means predicting their locations in sequences etc), but maybe theres something there you can make work:

ADD REPLYlink modified 8 months ago • written 8 months ago by jrj.healey12k

Hello Mr. Healey,

The two articles you referred me to are indeed some good extra knowledge and I have already gone through both.

Thanks a lot for sharing your thoughts regarding this issue. I will also have a look on the site you mentioned because I was not aware of it ( and whatever has to do with bioinformatic tools in general I am a newbie in this field)

Thanks a lot !

ADD REPLYlink modified 8 months ago • written 8 months ago by Alexandros.Frydas20
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 971 users visited in the last hour