Entering edit mode
6.3 years ago
melisechaves
•
0
I downloaded the proteome from 2,500 bacterial strains deposited in GenBank. Each strain has a directory, in which are files corresponding to the chromosome and plasmids, separately. However, I can only know if the file corresponds to the chromosome or plasmid if I look for the code on the GenBank website and see its description. How would a script to separate chromosomal proteins from plasmids in two files only?
Can you paste an example of how these records appear on GenBank?; Also can you paste an example of the directory structure that you have?