Danskin's theorem - Alchetron, The Free Social Encyclopedia

In convex analysis, Danskin's theorem is a theorem which provides information about the derivatives of a function of the form

f ( x ) = max z ∈ Z ϕ ( x , z ) .

The theorem has applications in optimization, where it sometimes is used to solve minimax problems. The original theorem by J. M. Danskin, given in his 1967, monograph "The Theory of Max-Min and its Applications to Weapons Allocation Problems," Springer, NY, provides a formula for the directional derivative of the maximum of a (not necessarily convex) directionally differentiable function. When adapted to the case of a convex function, this formula yields the following theorem given in somewhat more general form as Proposition A.22 in the 1971 Ph.D. Thesis by D. P. Bertsekas, "Control of Uncertain Systems with a Set-Membership Description of the Uncertainty". A proof of the following version can be found in the 1999 book "Nonlinear Programming" by Bertsekas (Section B.5).

Statement

The theorem applies to the following situation. Suppose ϕ ( x , z ) is a continuous function of two arguments,

ϕ : R n × Z → R

where Z ⊂ R m is a compact set. Further assume that ϕ ( x , z ) is convex in x for every z ∈ Z .

Under these conditions, Danskin's theorem provides conclusions regarding the differentiability of the function

f ( x ) = max z ∈ Z ϕ ( x , z ) .

To state these results, we define the set of maximizing points Z 0 ( x ) as

Z 0 ( x ) = { z ¯ : ϕ ( x , z ¯ ) = max z ∈ Z ϕ ( x , z ) } .

Danskin's theorem then provides the following results.

Convexity

f ( x ) is convex.

Directional derivatives

The directional derivative of f ( x ) in the direction y , denoted D y f ( x ) , is given by D y f ( x ) = max z ∈ Z 0 ( x ) ϕ ′ ( x , z ; y ) , where ϕ ′ ( x , z ; y ) is the directional derivative of the function ϕ ( ⋅ , z ) at x in the direction y .

Derivative

f ( x ) is differentiable at x if Z 0 ( x ) consists of a single element z ¯ . In this case, the derivative of f ( x ) (or the gradient of f ( x ) if x is a vector) is given by ∂ f ∂ x = ∂ ϕ ( x , z ¯ ) ∂ x .

Subdifferential

If ϕ ( x , z ) is differentiable with respect to x for all z ∈ Z , and if ∂ ϕ / ∂ x is continuous with respect to z for all x , then the subdifferential of f ( x ) is given by ∂ f ( x ) = c o n v { ∂ ϕ ( x , z ) ∂ x : z ∈ Z 0 ( x ) } where c o n v indicates the convex hull operation.

Extension

The 1971 Ph.D. Thesis by Bertsekas (Proposition A.22) proves a more general result, which does not require that ϕ ( ⋅ , z ) is differentiable. Instead it assumes that ϕ ( ⋅ , z ) is an extended real-valued closed proper convex function for each z in the compact set Z , that i n t ( d o m ( f ) ) , the interior of the effective domain of f , is nonempty, and that ϕ is continuous on the set i n t ( d o m ( f ) ) × Z . Then for all x in i n t ( d o m ( f ) ) , the subdifferential of f at x is given by

where ∂ ϕ ( x , z ) is the subdifferential of ϕ ( ⋅ , z ) at x for any z in Z .

References

Danskin's theorem Wikipedia

(Text) CC BY-SA