How to get BLAST wgs database
1
1
Entering edit mode
2.7 years ago
Michael ▴ 240

I would like to "mimic" the online BLASTn wgs search (NOT nt, this works already) on my local machine. The reason is that I want to extend from the BLAST nt search to a broader range of bacterial strains.

It seems like there is no BLAST wgs database to download. How do I determine which genomes I have to download to catch the genomes in wgs. I only need it for bacteria for a start.

An alternative would be to not use the entire wgs but some method to at least "broaden" the search compared to BLAST nt.

BLAST wgs • 2.0k views
ADD COMMENT
3
Entering edit mode
2.7 years ago
GenoMax 111k

Did you take a look at this README file NCBI provides for instructions on how to deal with WGS data with blast+? Scripts you need are in this directory. Bacteria are taxid 2.

ADD COMMENT
0
Entering edit mode

That's really cool! This means that the data is first downloaded to the local machine to execute the search, right? So the search is not performed on NCBI servers?

ADD REPLY
0
Entering edit mode

You should check that by using a specific bacterial taxid first (e.g. Pseudomonas etc). If the data is downloaded locally then you had better have access to plenty of local storage :-)

ADD REPLY
0
Entering edit mode

(credit to https://github.com/khyox/draftGenomes/issues/1)

IIUC, taxid2wgs doesn't report all wgs records. for example:

perl taxid2wgs.pl 90371 | grep AAJDPJ

would return nothing, even though https://www.ncbi.nlm.nih.gov/assembly/GCA_007903735.1 exists.

ADD REPLY

Login before adding your answer.

Traffic: 2467 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6