diamond seems to run out of memory
1
0
Entering edit mode
3.0 years ago
Ric ▴ 440

Hi, I ran diamond on a 128GB ram computer with a 2.8Gb genome. However, it appears it ran out of memory. Would there be a way to reduce the amount of memory?

$ diamond makedb --in GenbankAA.fasta --db diamond -p 8
$ diamond blastx --threads 1 -q genome.softmasked.fasta --db diamond -o diamond.matches.tab -e 1e-10 -k 0 --more-sensitive -f 6 sseqid slen sstart send qseqid qlen qstart qend pident length evalue score qcovhsp qframe
...
Searching alignments...  [0.602s]
Deallocating buffers...  [0s]
Clearing query masking...  [0.247s]
Computing alignments... Error: std::bad_alloc

Thank you in advance

alignment gene • 2.8k views
ADD COMMENT
3
Entering edit mode
3.0 years ago
buchfink ▴ 250

Diamond is limited in processing that long of a query sequence in blastx mode. One thing you can try is to use the frameshift mode (option -F 15). Otherwise, you probably should extract ORFs from the genome and search those.

ADD COMMENT
0
Entering edit mode
  1. Thank you, -F 15 worked. What does frameshift mode mean and why 15 has been chosen?
  2. How would you extract ORFs from the genome?
ADD REPLY
0
Entering edit mode

You will need to predict genes and then extract the sequences.

ADD REPLY
0
Entering edit mode

Thank you.

ADD REPLY
0
Entering edit mode
  1. This mode is able to compute alignments that contain a change in the reading frame. 15 is the penalty and only a recommended value. The mode was also designed to handle longer sequences.
  2. There are tools for doing this, I'm sure google can help.
ADD REPLY
0
Entering edit mode

I created for each of the 19 chromosomes a FASTA file. Each of the 19 FASTA files is around 150Mb. Is frameshift mode (option -F 15) a good value?

ADD REPLY
0
Entering edit mode

I have not tested it on this kind of data but you can surely try.

ADD REPLY

Login before adding your answer.

Traffic: 1716 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6