Question: makeblastdb.exe Crashes when using a cleaned FASTA file
gravatar for friedrichlab
23 months ago by
friedrichlab0 wrote:

Using bedtools2, I extracted the CDS fasta from a GFF3 file and it's reference fasta. I then attempted to create a local database using blast command line and the 'makeblastdb' command. This failed due to a large number of duplicated sequences in the file. I then ran the fasta through the sequence cleaner python script found here:

Now, with my newly cleaned fasta, I am again attempting to create a custom database. In windows command prompt, running the command:

makeblastdb -in clear_transcriptome.fa -out clear_transcriptome -dbtype nucl -parse_seqids

Causes a window to open saying "makeblastdb.exe has stopped working".

I have attempted a fresh install of Blast command line, and have successfully built a database with a different fasta file (that has worked in the past), but this error is still occurring.

Additionally, running the exact same command WITHOUT the '-parse_seqids' option successfully builds a database, but not one that I can blast against (tblastn returns "No alias or index file found for nucleotide database").

For refrenece, here is the format of the first few sequences in my clear_transcriptome.fa file (the ellipses are only for space conservation, as this post is long enough already):









I will admit it is not the cleanest, but that is a result of the sequence cleaner python script.

Any idea as to what about the sequence ids is causing the crashes? Could it be the excessive length of some of them? If so, why would that cause this issue? If any more information is needed let me know.

Thanks in advance.

ADD COMMENTlink modified 23 months ago • written 23 months ago by friedrichlab0
gravatar for genomax
23 months ago by
United States
genomax69k wrote:

Could it be the excessive length of some of them?

That could be one of the issues. You could shorten the headers by using something here: Fasta header trimming

ADD COMMENTlink written 23 months ago by genomax69k

That worked perfectly! Thank you very much!

ADD REPLYlink written 23 months ago by friedrichlab0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 601 users visited in the last hour