In mathematics, Budan's theorem, named for François Budan de Boislaurent, is an early theorem for computing an upper bound on the number of real roots a polynomial has inside an open interval by counting the number of sign variations or sign changes in the sequences of coefficients.
Contents
- Sign variation
- Budans theorem
- Statement of Budans theorem
- Examples of Budans theorem
- Fouriers theorem
- Fouriers sequence
- Example of Fouriers sequence
- Statement of Fouriers theorem
- Example of Fouriers theorem
- Historical background
- Early applications of Budans theorem
- Disappearance of Budans theorem
- Comeback of Budans theorem
- Equivalence between the theorems by Budan and Fourier
- The most significant application of Budans theorem
- Vincents theorem 1834 and 1836
- Vincents implementation of his own theorem
- Uspenskys implementation of Vincents theorem
- References
Since 1836, the statement of Budan's theorem has been replaced in the literature by the statement of an equivalent theorem by Joseph Fourier, and the latter has been referred to under various names, including Budan's. Budan's original theorem forms the basis of the fastest known method for the isolation of the real roots of polynomials.
Sign variation
Let- If
r = l + 1 the numbersc l c r - If
r ≥ l + 2 the numbersc l + 1 , … , c r − 1 c l c r
Budan's theorem
Budan's theorem is equivalent to Fourier's theorem. Though Budan's formulation preceded Fourier's, the name Fourier has usually been associated with it.
Statement of Budan's theorem
Given an equation in
- The polynomial
p ( x + l ) cannot have fewer sign variations than those ofp ( x + r ) . In shortv l ≥ v r - The number
ρ of the real roots of the equationp ( x ) = 0 located in the open interval( l , r ) can never be more than the number of sign variations lost in passing from the transformed polynomialp ( x + l ) to the transformed polynomialp ( x + r ) . In short,ρ ≤ v l − v r - When the number
ρ of the real roots of the equationp ( x ) = 0 located in the open interval( l , r ) is strictly less than the number of the sign variations lost in passing from the transformed polynomialp ( x + l ) to the transformed polynomialp ( x + r ) then the difference is always an even number. In short,ρ = v l − v r − 2 λ whereλ ∈Z +
Making use of the substitutions
- If there is no sign variation loss, then there are no real roots in the interval
( l , r ) . - If there is one sign variation loss, then there is exactly one real root in the interval
( l , r ) . The inverse statement does not hold in this case.
Examples of Budan's theorem
1. Given the polynomial
Thus, from Budan's theorem
2. Given the same polynomial
By Budan's theorem
The last example indicates the main use of Budan's theorem as a "no roots test".
Fourier's theorem
The statement of Fourier's theorem (for polynomials) which also appears as Fourier–Budan theorem or as the Budan–Fourier theorem or just as Budan's theorem can be found in almost all texts and articles on the subject.
Fourier's sequence
Given an equation in
Example of Fourier's sequence
The Fourier sequence of the polynomial
Statement of Fourier's theorem
Given the polynomial equation
- The sequence
F seq ( l ) cannot present fewer sign variations than the sequenceF seq ( r ) . In short,v l ≥ v r - The number
ρ of the real roots of the equationp ( x ) = 0 located in the open interval( l , r ) can never be more than the number of sign variations lost in passing from the sequenceF seq ( l ) to the sequenceF seq ( r ) . In short,ρ ≤ v l − v r - When the number
ρ of the real roots of the equationp ( x ) = 0 located in the open interval( l , r ) is strictly less than the number of the sign variations lost in passing from the sequenceF seq ( l ) to the sequenceF seq ( r ) then the difference is always an even number. In short,ρ = v l − v r − 2 λ whereλ ∈ Z +
Example of Fourier's theorem
Given the previously mentioned polynomial
Thus, from Fourier's theorem
Historical background
In the beginning of the 19th century, F. D. Budan and J. B. J. Fourier presented two different (but equivalent) theorems which enable us to determine the maximum possible number of real roots that an equation has within a given interval.
Budan's formulation is rarely cited. Instead, Fourier's formulation is usually used, and named the Fourier, Fourier–Budan, Budan–Fourier, or even Budan's theorem. The actual statement of Budan's theorem appeared in 1807 in the memoir "Nouvelle méthode pour la résolution des équations numériques", whereas Fourier's theorem was first published in 1820 in the "Bulletin des Sciences, par la Société Philomatique de Paris". Due to the importance of these two theorems, there was a great controversy regarding priority rights.
Early applications of Budan's theorem
In "Nouvelle méthode pour la résolution des équations numériques", Budan himself used his theorem to compute the roots of any polynomial equation by calculating the decimal digits of the roots. More precisely, Budan used his theorem as a "no roots test", which can be stated as follows: if the polynomials
Furthermore, in his book, p. 37, Budan presents, independently of his theorem, a "0_1 roots test", that is a criterion for determining whether a polynomial has any roots in the interval (0,1). This test can be stated as follows:
Perform on
and
This test (which is a special case of the more general Alesina–Galuzzi "a_b roots test") was subsequently used by Uspensky in the 20th century. Uspensky was the one who kept Vincent's theorem alive carrying the torch (so to speak) from Serret.,)
In 1831, Bourdon, combined Budan's theorem and Lagrange's continued fraction method for approximating real roots of polynomials and, thus, gave a preview of Vincent's method, without actually giving credit to him. As Vincent mentions in the very first sentence of his 1834 papers, and 1836 Bourdon used (in his book) a joint presentation of theirs.
Disappearance of Budan's theorem
Budan's theorem forms the basis for Vincent's theorem and Vincent's (exponential) method for the isolation of the real roots of polynomials. Therefore, there is no wonder that Vincent in both of his papers of 1834 and 1836 states Budan's theorem and contrasts it with the one by Fourier. Vincent was the last author in the 19th century to state Budan's theorem in its original form.
Despite the fact that Budan's theorem was of such great importance, the appearance of Sturm's theorem in 1827 gave it (and Vincent's theorem) the death blow. Sturm's theorem solved the real root isolation problem, by defining the precise number of real roots a polynomial has in a real open interval (a, b); moreover, Sturm himself, p. 108, acknowledges the great influence Fourier's theorem had on him: « C'est en m'appuyant sur les principes qu'il a posés, et en imitant ses démonstrations, que j'ai trouvé les nouveaux théorèmes que je vais énoncer. » which translates to «It is by relying upon the principles he has laid out and by imitating his proofs that I have found the new theorems which I am about to present.» Because of the above, the theorems by Fourier and Sturm appear in almost all the books on the theory of equations and Sturm's method for computing the real roots of polynomials has been the only one widely known and used ever since – up to about 1980, when it was replaced (in almost all computer algebra systems) by methods derived from Vincent's theorem, the fastest one being the Vincent–Akritas–Strzeboński (VAS) method.
Consequently, Budan's theorem (but not his name) was pushed into oblivion. In Serret's book there is section 121 (p. 266) on Budan's theorem but the statement is the one due to Fourier, because, as the author explains in the footnote of p. 267, the two theorems are equivalent and Budan had clear priority. To his credit, Serret included in his Algebra, pp 363–368, Vincent's theorem along with its proof and directed all interested readers to Vincent's papers for examples on how it is used. Serret was the last author to mention Vincent's theorem in the 19th century.
Comeback of Budan's theorem
Budan's theorem reappeared, after almost 150 years, in Akritas' Ph.D. Thesis "Vincent's Theorem in Algebraic Manipulation", North Carolina State University, USA, 1978, and in several publications that resulted from that dissertation.
Equivalence between the theorems by Budan and Fourier
Budan's theorem is equivalent to the one by Fourier. This equivalence is obvious from the fact that, given the polynomial
As Alesina and Galuzzi point out in Footnote 9, p. 222 of their paper, the controversy over priority rights of Budan or Fourier is rather pointless from a modern point of view. The two authors think that Budan has an "amazingly modern understanding of the relevance of reducing the algorithm (his own word) to translate a polynomial by
Despite their equivalence, the two theorems are quite distinct concerning the impact they had on the isolation of the real roots of polynomials with rational coefficients. To wit:
The most significant application of Budan's theorem
Vincent's (exponential) method for the isolation of the real roots of polynomials (which is based on Vincent's theorem of 1834 and 1836) is the most significant application of Budan's theorem. Moreover, it is the most representative example of the importance of the statement of Budan's theorem. As explained below, knowing the statement of Fourier's theorem did not help Uspensky realize that there are no roots of
Vincent's theorem (1834 and 1836)
If in a polynomial equation with rational coefficients and without multiple roots, one makes successive transformations of the form
where a, b, and c are any positive numbers greater than or equal to one, then after a number of such transformations, the resulting transformed equation either has zero sign variations or it has a single sign variation. In the first case there is no root, whereas in the second case there is a single positive real root. Furthermore, the corresponding root of the proposed equation is approximated by the finite continued fraction:
Finally, if infinitely many numbers satisfying this property can be found, then the root is represented by the (infinite) corresponding continuous fraction.
The above statement is an exact translation of the theorem found in Vincent's original papers; for a clearer understanding see the remarks in the Wikipedia article Vincent's theorem
Vincent's implementation of his own theorem
Vincent uses Budan's theorem exclusively as a "no roots test" to locate where the roots lie on the x-axis (to compute the quantities
See the corresponding diagram where the root lies in the interval
Uspensky's implementation of Vincent's theorem
According to Alexei Uteshev of St. Petersburg University, Russia, Uspensky came upon the statement (and proof) of Vincent's theorem in the 20th century in Serret's Algebra, pp 363–368, which means that he was not aware of the statement of Budan's theorem (because Serret included in his book Fourier's theorem). Moreover, this means that Uspensky never saw Vincent's papers of 1834 and 1836, where Budan's theorem is stated and Vincent's method is explained with several examples (because Serret directed all interested readers to Vincent's papers for examples on how the theorem is used). Therefore, in the preface of his book that came out in 1949, Uspensky erroneously claimed that, based on Vincent's theorem, he had discovered a method for isolating the real roots "much superior in practice to that based on Sturm's Theorem". Uspensky's statement is erroneous because, since he is not using Budan's theorem, he is isolating the real roots doing twice the amount of work done by Vincent (see, pp. 127–137).
Uspensky does not know Budan's theorem and, hence, he cannot use it as a "no roots test". So, for him it does not suffice that
Uspensky's transformations are not the ones described in Vincent's theorem, and consequently, his transformations take twice as much computation time as the ones needed for Vincent's method.