Supriya Ghosh (Editor)

Penalty method

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Penalty method

Penalty methods are a certain class of algorithms for solving constrained optimization problems.

Contents

A penalty method replaces a constrained optimization problem by a series of unconstrained problems whose solutions ideally converge to the solution of the original constrained problem. The unconstrained problems are formed by adding a term, called a penalty function, to the objective function that consists of a penalty parameter multiplied by a measure of violation of the constraints. The measure of violation is nonzero when the constraints are violated and is zero in the region where constraints are not violated.

Example

Let us say we are solving the following constrained problem:

min f ( x )

subject to

c i ( x ) 0   i I .

This problem can be solved as a series of unconstrained minimization problems

min Φ k ( x ) = f ( x ) + σ k   i I   g ( c i ( x ) )

where

g ( c i ( x ) ) = max ( 0 , c i ( x ) ) 2 .

In the above equations, g ( c i ( x ) ) is the penalty function while σ k are the penalty coefficients. In each iteration k of the method, we increase the penalty coefficient σ k (e.g. by a factor of 10), solve the unconstrained problem and use the solution as the initial guess for the next iteration. Solutions of the successive unconstrained problems will eventually converge to the solution of the original constrained problem.

Practical application

Image compression optimization algorithms can make use of penalty functions for selecting how best to compress zones of colour to single representative values.

Barrier methods

Barrier methods constitute an alternative class of algorithms for constrained optimization. These methods also add a penalty-like term to the objective function, but in this case the iterates are forced to remain interior to the feasible domain and the barrier is in place to bias the iterates to remain away from the boundary of the feasible region.

References

Penalty method Wikipedia


Similar Topics