Entering edit mode
10.2 years ago
PS
▴
30
NCBI has a genomes directory on their ftp site which contains genome sequence and annotation for different organisms.
- Are all the sequences in this directory also found in the BLAST nt database? I'm especially interested in knowing if Bacteria and Bacteria_DRAFT sequences are in nt. I know RefSeq and Genbank sequences are in nt, but the README seems to indicate that there are additional sequences in this directory as well.
- Are all sequences in Genbank in this directory? Again, I was confused by the wording that indicates that finished genomes submitted to Genbank that have no additional processing by NCBI are in the genbank directory, but does that mean they are not also in the genomes directory?
- What is BACTERIA_ASSEMBLY in relation to Bacteria_DRAFT?