Blast database on SSD?
1
0
Entering edit mode
6.4 years ago

I'm trying to search a large batch of sequences against the nr database with locally installed blast, running blastx with -task blastx-fast. I've split the file into batches of a few thousand sequences to run them in parallel, it's going to take weeks at this rate. Might the search proceed faster if the nr database was kept on an SSD drive or stick rather than on an ordinary hard drive?

blast • 1.8k views
ADD COMMENT
0
Entering edit mode

Perhaps. But having plenty of RAM (~40G) and DIAMOND may be something you should look at. You will need to create DIAMOND blast indexes for nr. A normal stick (if you mean a plain USB drive) is not going to cut it at all.

ADD REPLY
0
Entering edit mode

Can you alter your workflow any so that you don’t have to brute force it? Maybe you can cluster the sequences first or use HMMs to reduce your dataset size?

ADD REPLY
1
Entering edit mode
6.4 years ago

Update on the question: I've transferred the database to an SSD drive which yielded a roughly fivefold increase in speed with number of threads, input size and memory being equal. Worth trying if there's a large number of sequences to search.

ADD COMMENT

Login before adding your answer.

Traffic: 1950 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6