Unusual bacterial start codons when using tBLASTn
1
0
Entering edit mode
8.4 years ago
Jenez ▴ 540

Hello!

I have a quick question regarding bacterial start codons. I have downloaded the full plasmid database set of sequences from ncbi here: ftp://ftp.ncbi.nlm.nih.gov/genomes/Plasmids

I've made blast databases of the full nucleotide sequences of the plasmids (ffn) as well as of the resulting proteins (faa) of each plasmid. What I've done is to BLASTp against the coding proteins database using various input sequences, and used the resulting proteins to tBLASTn against the full plasmid nucleotide sequences to extract all the 100% identical nucleotide variants of these proteins. This works quite well, but in some cases the sequence returned does not have a typical start codon which you would expect. Normally, you'd expect to see either ATG, GTG, or TTG as the start codon. But here I am with a sequence whose start codon is AAA, coding for lysine, which results in my identity being less than 100%.

Now, my guess is that with automatic annotation the start codon is set to Methionine if the remaining sequence has a 100% match to whatever it is being used as the reference, but if this is the case or not I have no idea.

Could someone with a bit of experience chime in and help me out?

Thank you in advance!

Edit:

Turns out there was a minor bug in my code which disabled the reverse complementary function, causing the ends to flip. My apologies.

codon sequence blast alignment • 1.8k views
ADD COMMENT
1
Entering edit mode
8.4 years ago

Take a look to this DOCUMENT to verify that there are more than one genetic code

In fact, in this page there are defined up to 25 different genetic codes, some affecting primitiva bacteria

ADD COMMENT

Login before adding your answer.

Traffic: 2594 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6