Lagrange inversion theorem - Alchetron, the free social encyclopedia

In mathematical analysis, the Lagrange inversion theorem, also known as the Lagrange–Bürmann formula, gives the Taylor series expansion of the inverse function of an analytic function.

Theorem statement

Suppose z is defined as a function of w by an equation of the form

f ( w ) = z

where f is analytic at a point a and f '(a) ≠ 0. Then it is possible to invert or solve the equation for w in the form of a series,

w = a + ∑ n = 1 ∞ g n ( z − f ( a ) ) n n ! ,

where

g n = lim w → a [ d n − 1 d w n − 1 ( w − a f ( w ) − f ( a ) ) n ] .

The theorem further states that this series has a non-zero radius of convergence, that is, that it represents an analytic function of z (which one may denote by g(z)) in a neighbourhood of z = f ( a ) . This is also called reversion of series.

If the assertions about analyticity are omitted, the formula is also valid for formal power series and can be generalized in various ways. It can be formulated for functions of several variables, it can be extended to provide a ready formula for F(g(z)) for any analytic function F, and it can be generalized to the case f '(a) = 0, where the inverse g is a multivalued function.

The theorem was proved by Lagrange and generalized by Hans Heinrich Bürmann, both in the late 18th century. There is a straightforward derivation using complex analysis and contour integration; the complex formal power series version is clearly a consequence of knowing the formula for polynomials, so the theory of analytic functions may be applied. Actually, the machinery from analytic function theory enters only in a formal way in this proof, in that what is really needed is just some property of the formal residue, and a more direct formal proof is available.

If f is a formal power series, then the above formula does not give the coefficients of the compositional inverse series g directly in terms for the coefficients of the series f. If we can express the functions f and g in formal power series as

f ( w ) = ∑ k = 0 ∞ f k w k k ! , a n d g ( z ) = ∑ k = 0 ∞ g k z k k ! ,

with f₀ = 0 and f₁ ≠ 0, then an explicit form of inverse coefficients can be given in term of Bell polynomials as

g n = 1 f 1 n ∑ k = 1 n − 1 ( − 1 ) k n ( k ) B n − 1 , k ( f ^ 1 , f ^ 2 , … , f ^ n − k ) , n ≥ 2 ,

with f ^ k = f k + 1 ( k + 1 ) f 1 , g 1 = 1 f 1 , and n ( k ) = n ( n + 1 ) ⋯ ( n + k − 1 ) , being the rising factorial.

Example

For instance, the algebraic equation of degree p

x p − x + z = 0

can be solved for x by means of the Lagrange inversion formula for the function f(x) = x − x^p, yielding to a formal series solution

x = ∑ k = 0 ∞ ( p k k ) z ( p − 1 ) k + 1 ( p − 1 ) k + 1 .

By convergence tests, this series is in fact convergent for |z| ≤ (p − 1)p^{−p/(p − 1)}, which is also the largest disk in which a local inverse to f can be defined.

Lagrange–Bürmann formula

There is a special case of Lagrange inversion theorem that is used in combinatorics and applies when f ( w ) = w / ϕ ( w ) for some analytic ϕ ( w ) with ϕ ( 0 ) ≠ 0. Take a = 0 to obtain f ( a ) = f ( 0 ) = 0. We have

g ( z ) = ∑ n = 1 ∞ ( lim w → 0 ( d n − 1 d w n − 1 ( w w / ϕ ( w ) ) n ) z n n ! ) = ∑ n = 1 ∞ 1 n ( 1 ( n − 1 ) ! lim w → 0 ( d n − 1 d w n − 1 ϕ ( w ) n ) ) z n ,

which can be written alternatively as

[ z n ] g ( z ) = 1 n [ w n − 1 ] ϕ ( w ) n ,

where [ w r ] is an operator which extracts the coefficient of w r in the Taylor series of a function of w.

A useful generalization of the formula is known as the Lagrange–Bürmann formula:

[ z n ] H ( g ( z ) ) = 1 n [ w n − 1 ] ( H ′ ( w ) ϕ ( w ) n )

where H can be an arbitrary analytic function, e.g. H(w) = w^k.

Sometimes, the derivative H' (w) can be quite complicated. A simpler version of the formula replaces H' (w) with H (w)(1-φ'(w)/φ(w)) to get

[ z n ] H ( g ( z ) ) = [ w n ] H ( w ) ϕ ( w ) n − 1 ( ϕ ( w ) − w ϕ ′ ( w ) ) ,

which involves φ'(w) instead of H' (w).

Lambert W function

The Lambert W function is the function W ( z ) that is implicitly defined by the equation

W ( z ) e W ( z ) = z .

We may use the theorem to compute the Taylor series of W ( z ) at z = 0. We take f ( w ) = w e w and a = b = 0. Recognizing that

d n d x n e α x = α n e α x

this gives

W ( z ) = ∑ n = 1 ∞ lim w → 0 ( d n − 1 d w n − 1 e − n w ) z n n ! = ∑ n = 1 ∞ ( − n ) n − 1 z n n ! = z − z 2 + 3 2 z 3 − 8 3 z 4 + O ( z 5 ) .

The radius of convergence of this series is e − 1 (this example refers to the principal branch of the Lambert function).

A series that converges for larger z (though not for all z) can also be derived by series inversion. The function f ( z ) = W ( e z ) − 1 satisfies the equation

1 + f ( z ) + ln ⁡ ( 1 + f ( z ) ) = z .

Then z + ln ⁡ ( 1 + z ) can be expanded into a power series and inverted. This gives a series for f ( z + 1 ) = W ( e z + 1 ) − 1 :

W ( e 1 + z ) = 1 + z 2 + z 2 16 − z 3 192 − z 4 3072 + 13 z 5 61440 − 47 z 6 1474560 − 73 z 7 41287680 + 2447 z 8 1321205760 + O ( z 9 ) .

W ( x ) can be computed by substituting ln ⁡ x − 1 for z in the above series. For example, substituting −1 for z gives the value of W ( 1 ) = 0.567143 .

Binary trees

Consider the set B of unlabelled binary trees. An element of B is either a leaf of size zero, or a root node with two subtrees. Denote by B n the number of binary trees on n nodes.

Note that removing the root splits a binary tree into two trees of smaller size. This yields the functional equation on the generating function B ( z ) = ∑ n = 0 ∞ B n z n :

B ( z ) = 1 + z B ( z ) 2 .

Now let C ( z ) = B ( z ) − 1 , one has thus C ( z ) = z ( C ( z ) + 1 ) 2 . Now apply the theorem with ϕ ( w ) = ( w + 1 ) 2 :

B n = [ z n ] C ( z ) = 1 n [ w n − 1 ] ( w + 1 ) 2 n = 1 n ( 2 n n − 1 ) = 1 n + 1 ( 2 n n ) .

We conclude that B n is the Catalan number.

Asymptotic approximation of integrals

In the Laplace-Erdelyi theorem that gives the asymptotic approximation for Laplace-type integrals, the function inversion is taken as a crucial step.

References

Lagrange inversion theorem Wikipedia

(Text) CC BY-SA

Contents