Hi there, Is there any software tool or database to identify the entire 5' and 3' UTR regions of bacterial gene? I am aware that eukaryotic genes are clearly annotated in Ensembl and Genbank with these details. But unfortunately I couldnt able to find this information for bacterial genes. Your help on this would be very much appreciated. Many thanks in advance.
I am not a bacterial expert at all, but conceptually you can check for these UTRs manually. In eukaryotes, the 5'UTR is defined as the sequence from the beginning of exon 1 to the base right upstream of the start codon. Likewise, the 3'UTR is the base right downstream after the stop codon until the end of the last exon. In prokaryotes, you could try to take the gene/operon annotation you have and define 5'UTR as the entire range from the beginning of the gene until the start codon and the 3'UTR as the range after the stop codon until the end of the gene.
Thanks.. I believe you mean by "range from the beginning of the gene..." actually "range from the beginning of the transcript...". But my problem begins how to define / identify / the beginning of the transcript and end of the transcript.. As explained below, my aim is to find accessible open areas in the mRNA secondary structures of a few bacterial genes (Ex. Alr, dxr) to identify Antisense oligonucleotides (ASOs) binding target. There are few programmes available to predict the secondary structure but we need to key in exactly entire mRNA CDS + 5' + 3' UTRs otherwise the secondary structure prediction won't be correct and we end up designing ASOs for wrong inaccessible area.
I had done this exactly a few years back but for Eukaryotic (huntington, DMD etc.) genes.. the advantage of Eukaryotic genes are that they are properly annotated in the genome databases including the UTR regions.. but for prokaryotes no such thing available..