In numerical analysis, the Durand–Kerner method, discovered by Karl Weierstrass in 1891 and rediscovered independently by Durand in 1960 and Kerner in 1966, is a root-finding algorithm for solving polynomial equations. In other words, the method can be used to solve numerically the equation
Contents
- Explanation
- Variations
- Example
- Derivation of the method via Newtons method
- Root inclusion via Gerschgorins circles
- Convergence results
- References
where ƒ is a given polynomial, which can be taken to be scaled so that the leading coefficient is 1.
Explanation
The explanation is for equations of degree four. It is easily generalized to other degrees.
Let the polynomial ƒ be defined by
ƒ(x) = x4 + ax3 + bx2 + cx + dfor all x.
The known numbers a, b, c, d are the coefficients.
Let the (complex) numbers P,Q,R,S be the roots of this polynomial ƒ.
Then
ƒ(x) = (x − P)(x − Q)(x − R)(x − S)for all x. One can isolate the value P from this equation,
So if used as a fixed point iteration
it is strongly stable in that every initial point x0 ≠ Q,R,S delivers after one iteration the root P=x1.
Furthermore, if one replaces the zeros Q, R and S by approximations q ≈ Q, r ≈ R, s ≈ S, such that q,r,s are not equal to P, then P is still a fixed point of the perturbed fixed point iteration
since
Note that the denominator is still different from zero. This fixed point iteration is a contraction mapping for x around P.
The clue to the method now is to combine the fixed point iteration for P with similar iterations for Q,R,S into a simultaneous iteration for all roots.
Initialize p, q, r, s:
p0 := (0.4 + 0.9 i)0 ;q0 := (0.4 + 0.9 i)1 ;r0 := (0.4 + 0.9 i)2 ;s0 := (0.4 + 0.9 i)3 ;There is nothing special about choosing 0.4 + 0.9 i except that it is neither a real number nor a root of unity.
Make the substitutions for n = 1,2,3,···
Re-iterate until the numbers p, q, r, s stop essentially changing in relative to the desired precision. Then they have the values P, Q, R, S in some order and in the chosen precision. So the problem is solved.
Note that you must use complex number arithmetic, and that the roots are found simultaneously rather than one at a time.
Variations
This iteration procedure, like the Gauss–Seidel method for linear equations, computes one number at a time based on the already computed numbers. A variant of this procedure, like the Jacobi method, computes a vector of root approximations at a time. Both variant are effective root-finding algorithms.
One could also choose the initial values for p,q,r,s by some other procedure, even randomly, but in a way that
and that
which may increasingly become a concern as the degree of the polynomial increases.
If the coefficients are real and the polynomial has odd degree, then it must have at least one real root. To find this, use a real value of p0 as the initial guess and make q0 and r0, etc, complex conjugate pairs. Then the iteration will preserve these properties; that is, pn will always be real, and qn and rn, etc, will always be conjugate. In this way, the pn will converge to a real root P. Alternatively, make all of the initial guesses real; they will remain so.
Example
This example is from the reference 1992. The equation solved is x3 − 3x2 + 3x − 5 = 0. The first 4 iterations move p, q, r seemingly chaotically, but then the roots are located to 1 decimal. After iteration number 5 we have 4 correct decimals, and the subsequent iteration number 6 confirms that the computed roots are fixed. This general behaviour is characteristic for the method.
Note that the equation has one real root and one pair of complex conjugate roots, and that the sum of the roots is 3.
Derivation of the method via Newton's method
For every n-tuple of complex numbers, there is exactly one monic polynomial of degree n that has them as its zeros (keeping multiplicities). This polynomial is given by multiplying all the corresponding linear factors, that is
This polynomial has coefficients that depend on the prescribed zeros,
Those coefficients are, up to a sign, the elementary symmetric polynomials
To find all the roots of a given polynomial
The Durand–Kerner method is obtained as the multidimensional Newton's method applied to this system. It is algebraically more comfortable to treat those identities of coefficients as the identity of the corresponding polynomials,
If the numbers
at the points
Root inclusion via Gerschgorin's circles
In the quotient ring (algebra) of residue classes modulo ƒ(X), the multiplication by X defines an endomorphism that has the zeros of ƒ(X) as eigenvalues with the corresponding multiplicities. Choosing a basis, the multiplication operator is represented by its coefficient matrix A, the companion matrix of ƒ(X) for this basis.
Since every polynomial can be reduced modulo ƒ(X) to a polynomial of degree n − 1 or lower, the space of residue classes can be identified with the space of polynomials of degree bounded by n − 1. A problem specific basis can be taken from Lagrange interpolation as the set of n polynomials
where
For the multiplication operator applied to the basis polynomials one obtains from the Lagrange interpolation
where
The companion matrix of ƒ(X) is therefore
From the transposed matrix case of the Gershgorin circle theorem it follows that all eigenvalues of A, that is, all roots of ƒ(X), are contained in the union of the disks
Here one has
Every conjugate matrix
Convergence results
The connection between the Taylor series expansion and Newton's method suggests that the distance from
For the conclusion of linear convergence there is a more specific result (see ref. Petkovic et al. 1995). If the initial vector
then this inequality also holds for all iterates, all inclusion disks
each containing exactly one zero of ƒ.