I want to create a local database using the ncbi database ftp files, am using clc main work bench to analyse my sequences (proteins), for some resean it does not connect to the ncbi serve to download the ncbi through the software, so i decided to download them manually from the server, i have tried to create a folder in the ncbi and make a local database but the software does not recognise the files , i think because they are already formated, so how can create a local database use the ftp files from ncbi
I use CLC as a primary tool for all my bioinformatic analysis. Ever since KLAST was disabled on CLC, I have been using the BLAST+ tool taht is built in. While CLC has a BLAST database manager that only allows pre-formated databases from the ://ftp.ncbi.nlm.nih.gov/blast/db/ FTP site. The databases are vast and extensive, it takes too much time to run. I found custom databases from ftp://ftp.ncbi.nlm.nih.gov/refseq/release/viral/.
To answer your question about custom local databases. Because CLC uses BLAST+ program. I was able to use the BLAST+ STANDALONE to format all my local databases.
(I am using windows)
First download BLAST+ on your machine from this site (ftp://ftp.ncbi.nlm.nih.gov/blast/executables/LATEST/)
follow the instruction to add /db/ folder.
Second format custom local databases
download any database with a .gz file.
add it to you /db/ directory and run the following commands to create the database.
Essentially converts them from .faa to the appropriate .pin, .nhr, .psi, etc...
For nucleotide fasta file: makeblastdb -in input_db -dbtype nucl -parse_seqids
For protein fasta file: makeblastdb -in input_db -dbtype prot -parse_seqids
Once formated, Simply restart CLC and refresh blast database manager and now you should have your local databases.
CREDIT source: ftp://ftp.hgc.jp/pub/mirror/ncbi/blast/documents/blastdb.html