I am trying to optimise bwa mem in order to find the most stringent alignment settings - prioritising accuracy over speed. (See related post).
I am trying to find a set of parameters that would:
- Set the probability of a mismatch/substitution/insertion being detected as equal
- Highly penalise the extension of gaps (insertions and deletions)
- Optimise for accuracy over speed
Here is what I'm currently using:
bwa mem \
-r1 \ # re-seeding trigger - default [1.5]
-B6 \ # Mismatch penalty - default [4]
-O6 \ # Gap open penalty - default [6]
-E10 \ # Gap extension penalty - default [1]
-H \ # Hard clipping - default [F]
${genome} \
${reads}
Does anyone have any suggestions for how I can optimise this to best fit my priorities?
Since the settings may be somewhat dependent on data at hand you may have to experiment yourself. Read length, size of database, low diversity are all going to affect final result.