Question

Gap Continuation Penalty With Dynamic Programming ?

2

Entering edit mode

12.3 years ago

User 5037 ▴ 290

Hi. When there is a match score, mismatch score and gap penalty the problem of aligning sequences can be done using dynamic programming. Is it possible to use gap continuation penalty in aligning two sequences under the dynamic programming method?

• 7.5k views

ADD COMMENT • link updated 10.1 years ago by Biostar 20 • written 12.3 years ago by User 5037 ▴ 290

score 3 · Answer 1 · 2011-12-21

3

Entering edit mode

12.3 years ago

Michael Kuhn 5.0k

Yes, this is possible. For example, the gap extension penalty has been implemented in JAligner and you can check the source code to see how it's done (in the construct function).

ADD COMMENT • link 12.3 years ago by Michael Kuhn 5.0k

0

Entering edit mode

Iam interested in manualy performing it. On paper. Could you please explain it ?

ADD REPLY • link 12.3 years ago by User 5037 ▴ 290

0

Entering edit mode

please look at the linked source code to see how it is done

ADD REPLY • link 12.3 years ago by Michael Kuhn 5.0k

0

Entering edit mode

somebody please help me

ADD REPLY • link 12.3 years ago by User 5037 ▴ 290

0

Entering edit mode

Introducing gap has to be more penalized than just extending already existing gap. For example, choosing gap instead of penalty for mismatch is much more important than extending 12 gaps into 13 gaps. Thats why the differentiation is made.

ADD REPLY • link 12.2 years ago by Biomonika (Noolean) 3.2k

score 3 · Answer 2 · 2012-01-04

See the introductory slides here. I think you understand how to fill the DP matrix. For each cell in the DP matrix, we pick the max of three directions from three adjacent cells: UP, LEFT, DIAGONAL. UP and LEFT give you one gap, DIAGONAL give you match/mismatch.

Now the affine gap penalty makes the calculation more difficult. For each cell, we still pick the max of the three directions. But now since the gap score is not linear anymore (i.e. Two gaps != 2 x one gap), you'll have to consider all the cells on LEFT, all the cells on UP, rather than just the immediate neighbor.

This also increase the computational complexity from squared to cubic.

score 0 · Answer 3 · 2012-03-01

If you can read Java code, here is a clear implementation of affine gap scores for the Needleman-Wunsch algorithm (and some other versions too). You can find more material on the author's site. That implementation comes straightforward from explanations in the Biological sequence analysis textbook.

BTW, the computational complexity is still squared (although you have to keep three DP matrices in the memory).