I work on E. coli genomes and while going through the various genes present it, I have seen that in the coordinates area of the description it is suggested to join different regions of genome.
I was wondering why is this so? Isn't a gene sequence meant to be continuous.
Another thing that I am not clear about is the difference between a gene and CDS - I see cases in E. coli (based on NCBI annotation file in FTP site) where mutiple CDS are present with gene, some times they overlap also. And in most of the cases the gene and the corresponding CDS have the same coordinates.
Please somebody clarify me regarding these two things.