Hi to all, I have a short question: After aligning protein coding nucleotide sequences I sometimes find gaps within a codon. Are these gaps always sequencing errors, or can a protein coding nucleotide sequence actually have gaps in the codons? All the best Dieter
When aligning two sequences, you're trying to maximize some measure of similarity between them. If you allow them, gaps can sometimes improve the quality of the alignment. How to interpret these gaps depends on the context. For example, when comparing sequences from different species, you may attribute the gaps to evolution.
I am not sure what you mean by "can a protein coding nucleotide sequence actually have gaps in the codons". Nucleic acids don't have physical gaps if that's what you mean.
Hello, Thank you.
Yes, I knew that.
OK, I think I need to explain more detailed. A protein coding nucleotide sequence (DNA) is a sequence, which can be transcripted into a RNA, and this RNA can be translated into a protein. Such nucleotide sequence always consist of "codons" - a codon is a set of three nucleotides. You can read more about it here: https://en.wikipedia.org/wiki/Coding_region
No, that's not what I meant --> better explaination of what I menat follows below (sorry, I'm a very beginner)
All the best Dieter
Did you discover introns?
I'm pretty sure that Jean-Karim Heriche knows about codons, dna and transcription. I'm less sure about your biological understanding, or you need to explain it again because it doesn't make sense to me.
Hi, Thank you,
No, Introns need to be excludet from a protein coding region. You only use the exons.
Ah, OK - I'm very sorry for explaining it. Wait - I will prepare a picture to explain what I mean. All the best Dieter
Hi, Ok, I hope this explains my question: Here is a typical "CDS-alignment". These are exons only, already "cleaned": LINK In the middle you see such a single gap, which I sometimes find. My question is: Is this a real gap in that sequence, which really belongs to the sequence of that species, or is it just an error which happened while sequencing? All the best and thanks for your answers. All the best Dieter
It depends on the context but a single nucleotide gap in a column otherwise very conserved and in a sequence that's otherwise almost identical to the others could indeed suggest a sequencing error.