Hello guys,
I downloaded the NCBI files and created my local db. Everything seems to go well, but running the blastn -db /mnt/NTFS/NGS-DBs/NCBI-RefSeq/ViralSeq_2021-12-14 -query otus.fasta -evalue 1e-3 -word_size 11 -outfmt "6 staxid staxids" I obtain N/A and 0, respectively.
Here is the complete procedure I executed:
##Create the NCBI RefSeq viruses db
#Download sequences
genome_updater.sh -d "refseq" -g "viral" -c "all" -f "genomic.fna.gz" -o "all_virus_genomes" -t 4
#Create single fasta file containing all fasta sequences from *.fna.gz files
zcat *.fna.gz > viralseq_2021-12-14.fna
#Create BLASTN database
makeblastdb -in viralseq_20212-12-14.fna -dbtype nucl -title ViralSeq -input_type fasta -out ViralSeq_2021-12-14
Please, can someone help me to understand where is my mistake? How can I solve the problem? Thank you
Hello. Thank you very much for your help. I looked for the right file to download, but the NCBI repository looks like a jungle ... Please, can you suggest me which file to download? Thank you.
All needed info should be explained on this page: https://www.ncbi.nlm.nih.gov/sites/books/NBK569841/
Hello, as suggested by lieven.sterck I built the new db with the following command (I hope the nucl_gb.accession2taxid is the right file for mapping):
Then I repeated my blastn command, but nothing changed:
I also checked if, for example, the "AC_000019.1" is contained in the mapping file (nucl_gb.accession2taxid):
It is so.
Where is my mistake?
Please.
Thank you
Use
101010button to formatcode. First option (that you are using) is for quoting text.Downloaded taxonomy files either need to be in the same folder as your index or in folder designated with
$BLASTDBvariable. This works with version 5 (-blastdb_version 5) of blast database which I assume you are using if you are using the latestblast+package.