Fermat's theorem on sums of two squares asserts that an odd prime number p can be expressed as
Contents
- Eulers proof by infinite descent
- Lagranges proof through quadratic forms
- Dedekinds two proofs using Gaussian integers
- Proof by Minkowskis Theorem
- Zagiers one sentence proof
- References
with integer x and y if and only if p is congruent to 1 (mod 4). The statement was announced by Girard in 1625, and again by Fermat in 1640, but neither supplied a proof.
The "only if" clause is easy: a perfect square is congruent to 0 or 1 modulo 4, hence a sum of two squares is congruent to 0, 1, or 2. An odd prime number is congruent to either 1 or 3 modulo 4, and the second possibility has just been ruled out. The first proof that such a representation exists was given by Leonhard Euler in 1747 and was complicated. Since then, many different proofs have been found. Among them, the proof using Minkowski's theorem about convex sets and Don Zagier's short proof based on involutions have appeared.
Euler's proof by infinite descent
Euler succeeded in proving Fermat's theorem on sums of two squares in 1749, when he was forty-two years old. He communicated this in a letter to Goldbach dated 12 April 1749. The proof relies on infinite descent, and is only briefly sketched in the letter. The full proof consists in five steps and is published in two papers. The first four steps are Propositions 1 to 4 of the first paper and do not correspond exactly to the four steps below. The fifth step below is from the second paper.
1. The product of two numbers, each of which is a sum of two squares, is itself a sum of two squares.
2. If a number which is a sum of two squares is divisible by a prime which is a sum of two squares, then the quotient is a sum of two squares. (This is Euler's first Proposition).
3. If a number which can be written as a sum of two squares is divisible by a number which is not a sum of two squares, then the quotient has a factor which is not a sum of two squares. (This is Euler's second Proposition).
4. If
5. Every prime of the form
Lagrange's proof through quadratic forms
Lagrange completed a proof in 1775 based on his general theory of integral quadratic forms. The following presentation incorporates a slight simplification of his argument, due to Gauss, which appears in article 182 of the Disquisitiones Arithmeticae.
An (integral binary) quadratic form is an expression of the form
The discriminant of the quadratic form is defined to be
Two forms
with
Lagrange proved that all positive definite forms of discriminant −4 are equivalent. Thus, to prove Fermat's theorem it is enough to find any positive definite form of discriminant −4 that represents
where the first coefficient a = p was chosen so that the form represents p by setting x = 1, and y = 0, the coefficient b = 2m is an arbitrary even number (as it must be, to get an even discriminant), and finally
As said, it suffices to find a root m of the polynomial
Dedekind's two proofs using Gaussian integers
Richard Dedekind gave at least two proofs of Fermat's theorem on sums of two squares, both using the arithmetical properties of the Gaussian integers, which are numbers of the form a + bi, where a and b are integers, and i is the square root of −1. One appears in section 27 of his exposition of ideals published in 1877; the second appeared in Supplement XI to Peter Gustav Lejeune Dirichlet's Vorlesungen über Zahlentheorie, and was published in 1894.
1. First proof. If
since the automorphism fixes the elements of Z/(p). In the current case,
2. Second proof. This proof builds on Lagrange's result that if
Proof by Minkowski's Theorem
For
The area of the fundamental parallelogram of the lattice is
Zagier's "one-sentence proof"
If p = 4k + 1 is prime, then the set S = {(x, y, z) ∈ N3: x2 + 4yz = p} (here the set N of all natural numbers can be taken to include 0 or to exclude 0, and in both cases, x, y and z must be positive for any (x, y, z) ∈ S, as p is an odd prime) is finite and has two involutions: an obvious one (x, y, z) → (x, z, y), whose fixed points, (x, y, y), correspond to representations of p as a sum of two squares, and a more complicated one,
which has exactly one fixed point, (1, 1, k). The cardinality of S has the same parity as the number of fixed points of an involution on that set. Thus, from the second involution we know that the cardinality of S is odd and therefore the number of fixed points for the first involution cannot be zero, proving the existence of fixed points for the first involution and consequently that p is a sum of two squares.
This proof, due to Zagier, is a simplification of an earlier proof by Heath-Brown, which in turn was inspired by a proof of Liouville. The technique of the proof is a combinatorial analogue of the topological principle that the Euler characteristics of a topological space with an involution and of its fixed point set have the same parity and is reminiscent of the use of sign-reversing involutions in the proofs of combinatorial bijections.