HI, does anyone know where in the ftp site of ncbi can I find the gbk files for the artificial sequences (i.e. plasmid vectors and so)?, there is a directory for genomes but I can't figure where these are located.
Thanks a lot.
Are you looking for univec?
Well, I did check univec but it is in fasta format, which is useless to me. thanks though.
I'm not very familiar with Genbank, but at ENA (the European Nucleotide Archive), the artificial sequences should be in the files that have the SYN (synthetic) taxonomic division label.
According to the release notes for the current ENA version 120 release it looks like there are 8 of such files on the ENA ftp site:
320 rel_pat_syn_01_r120.dat Patent Sequences
321 rel_pat_syn_02_r120.dat Patent Sequences
322 rel_pat_syn_03_r120.dat Patent Sequences
323 rel_pat_syn_04_r120.dat Patent Sequences
324 rel_pat_syn_05_r120.dat Patent Sequences
325 rel_pat_syn_06_r120.dat Patent Sequences
326 rel_pat_syn_07_r120.dat Patent Sequences
435 rel_std_syn_01_r120.dat Standard Sequences
ENA is the European counterpart of Genbank. Both are part of the INSDC (the International Nucleotide Sequence Database Collaboration) and contain the same information.
Hope this helps.
Login before adding your answer.
Use of this site constitutes acceptance of our User Agreement and Privacy