I would like to have an
overview of all formats a
acc. no. can have/ types of acc. no..
So far I have been working with acc. no. with
3 letters + 5 numerals or
3 l. + 7 n. or
3 l. + 9 n.
(eg. MAM88718, MBA1146895 or WP_088983797).
But now I have encountered some with only
1 numeral +3 letters (eg. 5YFE). It looks like a
pdb ID to me rather than an acc. no. but I got that reference from a
search in the non-redundant NCBI DB...
I would like to know where they come from.
Does anyone know what
kind of acc. no. this is?
I remember some time ago I found a nice list that displayed an overview of all kinds of acc. no. like:
Nucleotide: 1 letter + 5 numerals OR 2 letters + 6 numerals
Protein: 3 letters + 5 numerals
WGS: 4 letters + 2 numerals for WGS assembly version + 6-8 numerals
MGA: 5 letters + 7 numerals
... just like here: https://letgen.org/GGGB%20Genetics%20&%20Bioinformatics/accession-number/ or here: https://academic.oup.com/nar/article/44/D1/D733/2502674 (but neither of those is the website I remember - it was nicer).