News:You may need to cleanup duplicate NCBI BLAST nt database volumes on your system
0
0
Entering edit mode
8 months ago
PeterC_NCBI ▴ 410

If you maintain the NCBI BLAST Nucleotide Database (nt) on your file system, you may need to remove duplicate volumes the next time you update it .

The nt database will soon be more than 100 volumes, and we'll move from two to three digit volume numbers (e.g., nt.00.tar.gz -> nt.000.tar.gz). The previous two digit volumes will remain on your system if you don't take steps to remove them.

You can use the script cleanup-blastdb-volumes.py that is included in the BLAST+ release to conveniently remove any duplicate volumes.

See the BLAST Command Line Applications User Manual for more details.

You can download BLAST databases and the BLAST+ release from the NCBI BLAST FTP area.

databases NCBI BLAST • 1.0k views
ADD COMMENT
2
Entering edit mode

Any reason not to go to 4-digit volume numbers so you don't have to deal with this issue again in our lifetimes?

ADD REPLY
2
Entering edit mode

Good point. I'll pass this idea along to the development team.

ADD REPLY
0
Entering edit mode

Thanks @mensur.Your idea may be a while in implementation since it would require code changes. The current code already accommodates three digit volumes. It automatically switches over when the number of volumes is above 100, There are currently 110 volumes for nt on the ftp site.

ADD REPLY

Login before adding your answer.

Traffic: 1518 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6