This is not a question, just a small blog post. While developing a new version of Minia, I was wondering how assemblers filter out short contigs from their results. I thought maybe someone else could benefit from these observations.
- Return all contigs of length at least k+1, except:
- short (<= max(read length, 150 bp)) isolated contigs (those not connected to any other contig)
isolated ones with coverage at most 2actually that filter is disabled by default
(source: simplification.info and graph_simplification.hpp)
- Return all contigs of length at least 2*k
Minia v1 and v2: same as Velvet.
BTW not sure if this should be a Blog or Forum post. Or just not posted here :)