Question

Multiple START codons in DNA/RNA Translation

0

Entering edit mode

4.0 years ago

rebelCoder ▴ 20

Hello!

If we apply a basic algorithm to a reading frame to scan it and look for START and STOP codons (to assemble a possible protein), we get cases, when we have multiple START codons and one STOP codon:

Example:

Reading Frame: ['A', 'S', 'M', 'A', 'P', 'M', 'Q', 'P', 'I', 'T', 'P', 'S', 'A', 'T', '_', 'T']

We see that we have two START codons (M) and one STOP (_) codons and if we generate a possible protein from this reading frame, we will get two results:

1) MAPMQPITPSAT

2) MQPITPSAT

First one contains the second one in it and also has a START codon in it.

Question (from a programmer): Is there ever a need to generate sequences like that? Is it useful for some kind f statistics, or do we discard the first one and only use sequences that have one START and one STOP codon?

Regards. Juris.

Translation RNA DNA Codons • 6.9k views

ADD COMMENT • link updated 3.9 years ago by Biostar 20 • written 4.0 years ago by rebelCoder ▴ 20

0

Entering edit mode

I don't understand your problem. Many (most ?) proteins have more than one Methionin.

ADD REPLY • link 4.0 years ago by Pierre Lindenbaum 161k

0

Entering edit mode

Oh. It is not a problem at all. I understand the process and I have my code working just fine. All I am asking, is, are those, multiple START codon sequences useful and do we have any real proteins that have double or more START codons in them? It does not look like there are? So it is not as problem I have, but a question I ask about application and usefulness of those sequences. Some sequences I work on, generate 100s of redundant amino acid chains like that that have multiple START codons.

ADD REPLY • link 4.0 years ago by rebelCoder ▴ 20

0

Entering edit mode

Within reason (and supported by experimental evidence) proteins can have alternate start sites but every START codon you see is not going to code for a real protein.

ADD REPLY • link 4.0 years ago by GenoMax 141k

0

Entering edit mode

I think I found a good answer here:

It's common to have multiple ATG codons in an mRNA sequence. Generally, the first ATG serves as protein translation starting site and is considered as a start codon if that ATG is at the beginning of a full and functional open reading frame. This is because the specific motifs and structures of the mRNA sequence has to be integrated correctly into a ribosome, a protein translation machinery, to initiate the translation process. In some cases, two ATG codons are closely located in the 5' end of mRNA, one might generate a truncated protein with few amino acid residues only, but another can result in a functional protein. In this case, the second one can be considered as start codon for that functional protein sequence. This phenomenon is considered as a translational regulation of a functional gene.

https://www.researchgate.net/post/im_working_on_a_gene_which_sequence_have_two_ATG_so_i_get_confused_that_which_ATG_is_start_codon

ADD REPLY • link 4.0 years ago by rebelCoder ▴ 20

1

Entering edit mode

Generally, the first ATG serves as protein translation starting site and is considered as a start codon if that ATG is at the beginning of a full and functional open reading frame.

ADD REPLY • link 4.0 years ago by Pierre Lindenbaum 161k

0

Entering edit mode

You would likely need to look for additional clues, such as proximity to promoters and ribosome binding sites to determine which ATG is the 'right one'.

Biology is messy though, and its quite possible that the forms that appear 'truncated' to us, are still produced and have some sort of role we don't yet understand.

ADD REPLY • link 4.0 years ago by Joe 21k