I'm performing transcriptome de novo assembly from plant RNA-seq data using two strategies (the multiple k-mer approach and the single k-mer approach uising different assemblers) and then merging the several pre-assemblies. Now I'm trying to filter for redundancy using CD-HIT EST (Program: CD-HIT, V4.6), but the program stopped with an error.
Error looks like this:
228085 finished 175242 clusters
Apprixmated maximum memory consumption: 455M
writing new database
file opening failed
Program halted !!
Can someone help me to understand what happens?