![]() | ||
In mathematics, more specifically in multivariable calculus, the implicit function theorem is a tool that allows relations to be converted to functions of several real variables. It does so by representing the relation as the graph of a function. There may not be a single function whose graph can represent the entire relation, but there may be such a function on a restriction of the domain of the relation. The implicit function theorem gives a sufficient condition to ensure that there is such a function.
Contents
- History
- First example
- Definitions
- Statement of the theorem
- Regularity
- The circle example
- Application change of coordinates
- Example polar coordinates
- Banach space version
- Implicit functions from non differentiable functions
- References
The theorem states that if the equation F(x1, ..., xn, y1, ..., ym) = F(x, y) = 0 satisfies some mild conditions on its partial derivatives, then one can in principle (though not necessarily with an analytic expression) express the m variables yi in terms of the n variables xj as yi = fi(x), at least in some disk. Then each of these implicit functions fi(x), implied by F(x, y) = 0, is such that geometrically the locus defined by F(x, y) = 0 will coincide locally (that is in that disk) with the hypersurface given by y = f(x).
History
Augustin-Louis Cauchy (1789-1857) is credited with the first rigorous form of the implicit function theorem. Ulisse Dini (1845-1918) generalized the real-variable version of the implicit function theorem to the context of functions of any number of real variables.
First example
If we define the function
However, it is possible to represent part of the circle as the graph of a function of one variable. If we let
The purpose of the implicit function theorem is to tell us the existence of functions like
Definitions
Let f : Rn+m → Rm be a continuously differentiable function. We think of Rn+m as the Cartesian product Rn × Rm, and we write a point of this product as (x, y) = (x1, ..., xn, y1, ..., ym). Starting from the given function f, our goal is to construct a function g: Rn → Rm whose graph (x, g(x)) is precisely the set of all (x, y) such that f(x, y) = 0.
As noted above, this may not always be possible. We will therefore fix a point (a, b) = (a1, ..., an, b1, ..., bm) which satisfies f(a, b) = 0, and we will ask for a g that works near the point (a, b). In other words, we want an open set U of Rn containing a, an open set V of Rm containing b, and a function g : U → V such that the graph of g satisfies the relation f = 0 on U × V, and that no other points within U × V do so. In symbols,
To state the implicit function theorem, we need the Jacobian matrix of f, which is the matrix of the partial derivatives of f. Abbreviating (a1, ..., an, b1, ..., bm) to (a, b), the Jacobian matrix is
where X is the matrix of partial derivatives in the variables xi and Y is the matrix of partial derivatives in the variables yj. The implicit function theorem says that if Y is an invertible matrix, then there are U, V, and g as desired. Writing all the hypotheses together gives the following statement.
Statement of the theorem
Let f: Rn+m → Rm be a continuously differentiable function, and let Rn+m have coordinates (x, y). Fix a point (a, b) = (a1, ..., an, b1, ..., bm) with f(a, b) = c, where c ∈ Rm. If the Jacobian matrix Jf, y(a, b) = [(∂fi / ∂yj)(a, b)] is invertible, then there exists an open set U containing a, an open set V containing b, and a unique continuously differentiable function g: U → V such that
Regularity
It can be proven that whenever we have the additional hypothesis that f is continuously differentiable k times inside U × V, then the same holds true for the explicit function g inside U and
Similarly, if f is analytic inside U × V, then the same holds true for the explicit function g inside U. This generalization is called the analytic implicit function theorem.
The circle example
Let us go back to the example of the unit circle. In this case n = m = 1 and
Thus, here, the Y in the statement of the theorem is just the number 2b; the linear map defined by it is invertible iff b ≠ 0. By the implicit function theorem we see that we can locally write the circle in the form y = g(x) for all points where y ≠ 0. For (±1, 0) we run into trouble, as noted before. The implicit function theorem may still be applied to these two points, but writing x as a function of y, that is,
The implicit derivative of y with respect to x, and that of x with respect to y, can be found by totally differentiating the implicit function
giving
and
Application: change of coordinates
Suppose we have an m-dimensional space, parametrised by a set of coordinates
Now the Jacobian matrix of f at a certain point (a, b) [ where
where 1m denotes the m × m identity matrix, and J is the m × m matrix of partial derivatives, evaluated at (a, b). (In the above, these blocks were denoted by X and Y. As it happens, in this particular application of the theorem, neither matrix depends on a.) The implicit function theorem now states that we can locally express
Example: polar coordinates
As a simple application of the above, consider the plane, parametrised by polar coordinates (R, θ). We can go to a new coordinate system (cartesian coordinates) by defining functions x(R, θ) = R cos(θ) and y(R, θ) = R sin(θ). This makes it possible given any point (R, θ) to find corresponding cartesian coordinates (x, y). When can we go back and convert cartesian into polar coordinates? By the previous example, it is sufficient to have det J ≠ 0, with
Since det J = R, conversion back to polar coordinates is possible if R ≠ 0. So it remains to check the case R = 0. It is easy to see that in case R = 0, our coordinate transformation is not invertible: at the origin, the value of θ is not well-defined.
Banach space version
Based on the inverse function theorem in Banach spaces, it is possible to extend the implicit function theorem to Banach space valued mappings.
Let X, Y, Z be Banach spaces. Let the mapping f : X × Y → Z be continuously Fréchet differentiable. If
Implicit functions from non-differentiable functions
Various forms of the implicit function theorem exist for the case when the function f is not differentiable. It is standard that it holds in one dimension. The following more general form was proven by Kumagai based on an observation by Jittorntrum.
Consider a continuous function
where g is a continuous function from B0 into A0.