Samiksha Jaiswal (Editor)

History of Lorentz transformations

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
History of Lorentz transformations

The Lorentz transformations relate the space-time coordinates, which specify the position x, y, z and time t of an event, relative to a particular inertial frame of reference (the "rest system"), and the coordinates of the same event relative to another coordinate system moving in the positive x-direction at a constant speed v, relative to the rest system. It was devised as a theoretical transformation which makes the velocity of light invariant between different inertial frames. The coordinates of the event in this "moving system" are denoted x′, y′, z′ and t′. The rest system was sometimes identified with the luminiferous aether, the postulated medium for the propagation of light, and the moving system was commonly identified with the earth as it moved through this medium. Early approximations of the transformation were published by Voigt (1887) and Lorentz (1895). They were completed by Larmor (1897, 1900) and Lorentz (1899, 1904) and were brought into their modern form by Poincaré (1905), who gave the transformation the name of Lorentz. Eventually, Einstein (1905) showed in his development of special relativity that the transformations follow from the principle of relativity and the constant light speed alone, without requiring a mechanical aether, and are changing the traditional concepts of space and time. Subsequently, Minkowski used them to argue that space and time are inseparably connected as spacetime. Important contributions to the mathematical understanding of the Lorentz transformation were also made by other authors such as Vladimir Varićak (1910) and Vladimir Ignatowski (1910).

Contents

The Lorentz transformation has the form

x = γ ( x v t ) , y = y , z = z , t = γ ( t x v c 2 ) ,

v being the relative velocity of the two reference frames, and c the speed of light, and the Lorentz factor,

γ = 1 1 v 2 / c 2

In this article the historical notations are placed on the left, and modern notations on the right.

Sphere geometry in the 19th century

One of the defining properties of the Lorentz transformation is its group structure which leaves the expression x 2 + y 2 + z 2 c 2 t 2 invariant. So a spherical wave in one frame remains spherical in another one, which is often used to derive the Lorentz transformation. However, long before experiments and physical theories made the introduction of the Lorentz transformation necessary, transformation groups and sphere geometries transforming sphere into spheres have been discussed, such as the Transformation by reciprocal radii within Möbius geometry, and the Transformation by reciprocal directions within Laguerre geometry. Both can be seen as special cases of Lie sphere geometry. The connections of these transformations to Maxwell's equations and the laws of physics were discovered, however, only after 1905 when the Lorentz transformation was already derived in a different way by physicists.

In several papers between 1847 and 1850 it was shown by Joseph Liouville that the relation λ ( δ x 2 + δ y 2 + δ z 2 ) is invariant under the group of conformal transformations or the "Transformation by reciprocal radii" which transforms spheres into spheres. This theorem was extended to all dimensions by Sophus Lie (1871) so that λ ( δ x 1 2 + + δ x n 2 ) is invariant too. In 1909, Harry Bateman and Ebenezer Cunningham showed that not only the quadratic form but also Maxwells equations are covariant with respect to conformal transformation, irrespective of the choice of λ . This variant of conformal transformations were called spherical wave transformations by him. However, this covariance is restricted to certain areas such as electrodynamics, whereas the totality of natural laws in inertial frames is covariant under the Lorentz group.

Albert Ribaucour (1870) and in particular Edmond Laguerre (1880-1885) employed another variant, namely the "transformation by reciprocal directions" or „Laguerre inversion/transformation“ which transforms spheres into spheres and planes into planes. Laguerre explicitly wrote down the corresponding transformation formulas in 1882, with Gaston Darboux (1887) presenting them in respect to coordinates x , y , z , R (R being the radius):

x = x , z = 1 + k 2 1 k 2 z 2 k R 1 k 2 , y = y , R = 2 k z 1 k 2 1 + k 2 1 k 2 R ,

producing the following relation:

x 2 + y 2 + z 2 R 2 = x 2 + y 2 + z 2 R 2 .

Several authors showed the close relation to the Lorentz transformation (see Laguerre inversion and Lorentz transformation) – by setting R = t , c = 1 , and v = 2 k / ( 1 + k 2 ) , it follows

1 k 2 1 + k 2 = 1 v 2 = 1 γ , 2 k 1 k 2 = v γ ,

thus the above transformation becomes similar to a Lorentz transformation with z as direction of motion, except that the sign of t is reversed from t v z to v z t :

x = x , y = y , z = γ ( z v t ) , t = γ ( v z t )

Furthermore, the group isomorphism between the Laguerre group and Lorentz group was pointed out by Élie Cartan, Henri Poincaré and others (see Laguerre group isomorphic to Lorentz group).

Voigt (1887)

Woldemar Voigt (1887) developed a transformation in connection with the Doppler effect and an incompressible medium, being in modern notation:

ξ 1 = x 1 ϰ t η 1 = y 1 q ζ 1 = z 1 q τ = t ϰ x 1 ω 2 q = 1 ϰ 2 ω 2 x = x v t y = y γ z = z γ t = t v x c 2 1 γ = 1 v 2 c 2 .

If the right-hand sides of his equations are multiplied by γ they are the modern Lorentz transformation. In Voigt's theory the speed of light is invariant, but his transformations mix up a relativistic boost together with a rescaling of space-time. Optical phenomena in free space are scale, conformal (using the factor λ discussed above), and Lorentz invariant, so the combination is invariant too. For instance, Lorentz transformations can be extended by using l = λ :

x = γ l ( x v t ) , y = l y , z = l z , t = γ l ( t x v c 2 ) .

l = 1 / γ gives the Voigt transformation, l = 1 the Lorentz transformation. But scale transformations are not a symmetry of all the laws of nature, only of electromagnetism, so these transformations cannot be used to formulate a principle of relativity in general. It was demonstrated by Poincaré and Einstein that one has to set l = 1 in order to make the above transformation symmetric and to form a group as required by the relativity principle, therefore the Lorentz transformation is the only viable choice.

Voigt sent his 1887 paper to Lorentz in 1908, and that was acknowledged in 1909:

In a paper „Über das Doppler'sche Princip“, published in 1887 (Gött. Nachrichten, p. 41) and which to my regret has escaped my notice all these years, Voigt has applied to equations of the form (6) (§ 3 of this book) [namely Δ Ψ 1 c 2 2 Ψ t 2 = 0 ] a transformation equivalent to the formulae (287) and (288) [namely x = γ l ( x v t ) ,   y = l y ,   z = l z ,   t = γ l ( t v c 2 x ) ]. The idea of the transformations used above (and in § 44) might therefore have been borrowed from Voigt and the proof that it does not alter the form of the equations for the free ether is contained in his paper.

Also Hermann Minkowski said in 1908 that the transformations which play the main role in the principle of relativity were first examined by Voigt in 1887. Voigt responded in the same paper by saying that his theory was based on an elastic theory of light, not an electromagnetic one. However, he concluded that some results were actually the same.

Heaviside (1888), Thomson (1889), Searle (1896)

In 1888, Oliver Heaviside investigated the properties of charges in motion according to Maxwell's electrodynamics. He calculated, among other things, anisotropies in the electric field of moving bodies represented by this formula:

E = ( q r r 2 ) ( 1 v 2 sin 2 θ c 2 ) 3 / 2 .

Consequently, Joseph John Thomson (1889) found a way to substantially simplify calculations concerning moving charges by using the following mathematical transformation (like other authors such as Lorentz or Larmor, also Thomson implicitly used the Galilean transformation z v t in his equation):

z = { 1 ω 2 v 2 } 1 2 z z = z v t = z γ .

Thereby, inhomogeneous electromagnetic wave equations are transformed into a Poisson equation. Eventually, George Frederick Charles Searle noted in (1896) that Heaviside's expression leads to a deformation of electric fields which he called "Heaviside-Ellipsoid" of axial ratio

α : 1 : 1 α = 1 u 2 v 2 1 γ : 1 : 1 1 γ 2 = 1 v 2 c 2 .

Lorentz (1892, 1895)

In order to explain the aberration of light and the result of the Fizeau experiment in accordance with Maxwell's equations, Lorentz in 1892 developed a model ("Lorentz ether theory") in which the aether is completely motionless, and the speed of light in the aether is constant in all directions. In order to calculate the optics of moving bodies, Lorentz introduced the following quantities to transform from the aether system into a moving system (it's unknown whether he was influenced by Voigt, Heaviside, and Thomson).

x = V V 2 p 2 x t = t ϵ V x ϵ = p V 2 p 2 x = γ x = γ ( x v t ) t = t γ 2 v x c 2 = γ 2 ( t v x c 2 ) γ v c = v c 2 v 2

where x* is the Galilean transformation x-vt. Except the additional γ in the time transformation, this is the complete Lorentz transformation. While t is the "true" time for observers resting in the aether, t is an auxiliary variable only for calculating processes for moving systems. It is also important that Lorentz and later also Larmor formulated this transformation in two steps. At first an implicit Galilean transformation, and later the expansion into the "fictitious" electromagnetic system with the aid of the Lorentz transformation. In order to explain the negative result of the Michelson–Morley experiment, he (1892b) introduced the additional hypothesis that also intermolecular forces are affected in a similar way and introduced length contraction in his theory (without proof as he admitted). The same hypothesis was already made by George FitzGerald in 1889 based on Heaviside's work. While length contraction was a real physical effect for Lorentz, he considered the time transformation only as a heuristic working hypothesis and a mathematical stipulation.

In 1895, Lorentz further elaborated on his theory and introduced the "theorem of corresponding states". This theorem states that a moving observer (relative to the ether) in his „fictitious“ field makes the same observations as a resting observers in his „real“ field for velocities to first order in v / c . Lorentz showed that the dimensions of electrostatic systems in the ether and a moving frame are connected by this transformation:

x = x 1 p 2 V 2 y = y z = z t = t x = x v t = x γ y = y z = z t = t

For solving optical problems Lorentz used the following transformation, in which the modified time variable was called "local time" (German: Ortszeit) by him:

x = x p x t y = y p y t z = z p z t t = t p x V 2 x p y V 2 y p z V 2 z x = x v x t y = y v y t z = z v z t t = t v x c 2 x v y c 2 y v z c 2 z

With this concept Lorentz could explain the Doppler effect, the aberration of light, and the Fizeau experiment.

Larmor (1897, 1900)

In 1897, Larmor extended the work of Lorentz and derived the following transformation

x 1 = x ϵ 1 2 y 1 = y z 1 = z t = t v x / c 2 d t 1 = d t ϵ 1 2 ϵ = ( 1 v 2 / c 2 ) 1 x 1 = γ x = γ ( x v t ) y 1 = y z 1 = z t = t v x c 2 d t 1 = d t γ γ 2 = 1 1 v 2 c 2

Larmor noted that if it is assumed that the constitution of molecules is electrical then the FitzGerald-Lorentz contraction is a consequence of this transformation. It's notable that Larmor was the first who recognized that some sort of time dilation is a consequence of this transformation as well, because individual electrons describe corresponding parts of their orbits in times shorter for the [rest] system in the ratio 1 / γ .

In 1900 he modified the above local time t by replacing the expression v / c 2 with ϵ v / c 2 , by which it becomes identical to the one given by Lorentz in 1892. He started with the following transformation

x = x v t y = y z = z t = t t = t ϵ v x / c 2 x = x v t y = y z = z t = t t = t γ 2 v x c 2 = γ 2 ( t v x c 2 )

This transformation is just the Galilean transformation for the x , y , z , coordinates but contains Lorentz’s "local time". Larmor knew that the Michelson–Morley experiment was accurate enough to detect an effect of motion depending on the factor v 2 / c 2 , and so he sought the transformations which were "accurate to second order" (as he put it). Thus he wrote the final transformations (where x = x v t ) as:

x 1 = ϵ 1 2 x y 1 = y z 1 = z d t 1 = ϵ 1 2 d t = ϵ 1 2 ( d t v c 2 ϵ d x ) t 1 = ϵ 1 2 t v c 2 ϵ 1 2 x x 1 = γ x = γ ( x v t ) y 1 = y = y z 1 = z = z d t 1 = d t γ = 1 γ ( d t γ 2 v d x c 2 ) = γ ( d t v d x c 2 ) t 1 = t γ γ v x c 2 = γ ( t v x c 2 )

by which he arrived at the complete Lorentz transformation. Larmor showed that Maxwell's equations were invariant under this two-step transformation, "to second order in v / c ", as he put it.

Larmor gave credit to Lorentz in two papers published in 1904, in which he used the term "Lorentz transformation" for Lorentz's first order transformations of coordinates and field configurations:

p. 583: [..] Lorentz's transformation for passing from the field of activity of a stationary electrodynamic material system to that of one moving with uniform velocity of translation through the aether.

p. 585: [..] the Lorentz transformation has shown us what is not so immediately obvious [..]
p. 622: [..] the transformation first developed by Lorentz: namely, each point in space is to have its own origin from which time is measured, its "local time" in Lorentz's phraseology, and then the values of the electric and magnetic vectors [..] at all points in the aether between the molecules in the system at rest, are the same as those of the vectors [..] at the corresponding points in the convected system at the same local times.

Lorentz (1899, 1904)

Also Lorentz extended his theorem of corresponding states in 1899. First he wrote a tranformation equivalent to the one from 1892 (again, x must be replaced by x v t ):

x = V V 2 p x 2 x y = y z = z t = t p x V 2 p x 2 x x = γ x = γ ( x v t ) y = y z = z t = t γ 2 v x c 2 = γ 2 ( t v x c 2 )

Then he introduced a factor ε of which he said he has no means of determining it, and modified his transformation as follows (where the above value of t has to be inserted):

x = ε k x y = ε y z = ε x t = k ε t k = V V 2 p x 2 x = x v t = ε γ x y = ε y z = ε z t = γ 2 ( t v x c 2 ) = γ ε t γ = 1 1 v 2 c 2

This is identical to the complete Lorentz transformation when solved for x and t and with ε = 1 . Like Larmor, Lorentz noticed in 1899 also some sort of time dilation effect in relation to the frequency of oscillating electrons "that in S the time of vibrations be k ε times as great as in S 0 ", where S 0 is the aether frame.

In 1904 he rewrote the equations in the following form by setting l = 1 / ε (again, x must be replaced by x v t ):

x = k l x y = l y z = l z t = l k t k l w c 2 x x = γ l x = γ l ( x v t ) y = l y z = l z t = l t γ γ l v x c 2 = γ l ( t v x c 2 )

Under the assumption that l = 1 when v = 0 , he demonstrated that l = 1 must be the case at all velocities, therefore length contraction can only arise in the line of motion. So by setting the factor l to unity, Lorentz's transformations now assumed the same form as Larmor's and are now completed. Unlike Larmor, who restricted himself to show the covariance of Maxwell's equations to second order, Lorentz tried to widen its covariance to all orders in v / c . He also derived the correct formulas for the velocity dependence of electromagnetic mass, and concluded that the transformation formulas must apply to all forces of nature, not only electrical ones. However, he didn't achieve full covariance of the transformation equations for charge density and velocity. When the 1904 paper was reprinted in 1913, Lorentz therefore added the following remark:

One will notice that in this work the transformation equations of Einstein’s Relativity Theory have not quite been attained. [..] On this circumstance depends the clumsiness of many of the further considerations in this work.

Local time

Neither Lorentz or Larmor gave a clear physical interpretation of the origin of local time. However, Henri Poincaré in 1900 commented on the origin of Lorentz’s “wonderful invention” of local time. He remarked that it arose when clocks in a moving reference frame are synchronised by exchanging signals which are assumed to travel with the same speed c in both directions, which lead to what is nowadays called relativity of simultaneity, although Poincaré's calculation does not involve length contraction or time dilation. In order to synchronise the clocks here on Earth (the x , t frame) a light signal from one clock (at the origin) is sent to another (at x ), and is sent back. It's supposed that the Earth is moving with speed v in the x -direction (= x -direction) in some rest system ( x , t ) (i.e. the luminiferous aether system for Lorentz and Larmor). The time of flight outwards is

δ t o = x ( c v )

and the time of flight back is

δ t b = x ( c + v ) .

The elapsed time on the clock when the signal is returned is δ t o + δ t b and the time t = ( δ t o + δ t b ) / 2 is ascribed to the moment when the light signal reached the distant clock. In the rest frame the time t = δ t o is ascribed to that same instant. Some algebra gives the relation between the different time coordinates ascribed to the moment of reflection. Thus

t = t γ 2 v x c 2

identical to Lorentz (1892). Poincaré gave the result t = t v x / c 2 , which is the form used by Lorentz in 1895. Poincaré dropped the factor γ 2 under the assumption that

v 2 c 2 1 .

Similar physical interpretations of local time were later given by Emil Cohn (1904) and Max Abraham (1905).

Lorentz transformation

On June 5, 1905 (published June 9) Poincaré simplified the equations which are algebraically equivalent to those of Larmor and Lorentz and gave them the modern form. Apparently Poincaré was unaware of Larmor's contributions, because he only mentioned Lorentz and therefore used for the first time the name "Lorentz transformation".

x = k l ( x + ε t ) y = l y z = l z t = k l ( t + ε x ) k = 1 1 ε 2 .

Poincaré set the speed of light to unity, pointed out the group characteristics of the transformation by setting l = 1 , and modified/corrected Lorentz's derivation of the equations of electrodynamics in some details in order to fully satisfy the principle of relativity, i.e. making them fully Lorentz covariant.

In July 1905 (published in January 1906) Poincaré showed in detail how the transformations and electrodynamic equations are a consequence of the principle of least action; he demonstrated in more detail the group characteristics of the transformation, which he called Lorentz group, and he showed that the combination x 2 + y 2 + z 2 c 2 t 2 is invariant. He noticed that the Lorentz transformation is merely a rotation in four-dimensional space about the origin by introducing c t 1 as a fourth imaginary coordinate, and he used an early form of four-vectors.

Einstein (1905)

On June 30, 1905 (published September 1905) Einstein published what is now called special relativity and gave a new derivation of the transformation, which was based only on the principle on relativity and the principle of the constancy of the speed of light. While Lorentz considered "local time" to be a mathematical stipulation device for explaining the Michelson-Morley experiment, Einstein showed that the coordinates given by the Lorentz transformation were in fact the inertial coordinates of relatively moving frames of reference. For quantities of first order in v/c this was also done by Poincaré in 1900, while Einstein derived the complete transformation by this method. Unlike Lorentz and Poincaré who still distinguished between real time in the aether and apparent time for moving observers, Einstein showed that the transformations concern the nature of space and time.

The notation for this transformation is identical to Poincaré's of 1905, except that Einstein didn't set the speed of light to unity:

τ = β ( t v V 2 x ) ξ = β ( x v t ) η = y ζ = z β = 1 1 ( v V ) 2

Minkowski (1907–1908)

The work on the principle of relativity by Lorentz, Einstein, Planck, together with Poincaré's four-dimensional approach, were further elaborated by Hermann Minkowski in 1907 and 1908. Minkowski particularly reformulated electrodynamics in a four-dimensional way (Minkowski spacetime). For instance, he wrote x , y , z , i t in the form x 1 , x 2 , x 3 , x 4 . By defining ψ as the angle of rotation around the z -axis, the Lorentz transformation assumes the form (with c = 1 ).

x 1 = x 1 x 2 = x 2 x 3 = x 3 cos i ψ + x 4 sin i ψ x 4 = x 3 sin i ψ + x 4 cos i ψ cos i ψ = 1 1 q 2

Even though Minkowski ordinarily used the imaginary number i ψ , he for once directly used the tangens hyperbolicus in the equation for velocity

i tan i ψ = e ψ e ψ e ψ + e ψ = q with ψ = 1 2 ln 1 + q 1 q .

Minkowski's expression can also by written as ψ = artanh ( q ) and was later called rapidity. As a graphical representation of the Lorentz transformation he also invented the Minkowski diagram, which became a standard tool in textbooks and research articles on relativity:

Varićak (1910)

Minkowski's rapidity in terms of real hyperbolic functions was systematically employed by Vladimir Varićak in several papers starting from 1910, who represented the equations of special relativity on the basis of hyperbolic geometry. For instance, by setting l = c t and v / c = tanh u with u as rapidity he wrote the Lorentz transformation as follows:

l = x sinh u + l cosh u , x = x cosh u l sinh u , y = y , z = z , cosh u = 1 1 ( v c ) 2

Subsequently, other authors such as E. T. Whittaker (1910) or Alfred Robb (1911, who coined the name rapidity) used similar expressions, which are still used in modern textbooks.

Ignatowski (1910)

While earlier derivations and formulations of the Lorentz transformation relied from the outset on optics, electrodynamics, or the invariance of the speed of light, Vladimir Ignatowski (1910) showed that it is possible to use the principle of relativity (and related group theoretical principles) alone, in order to derive the following transformation between two inertial frames:

x = p ( x v t ) y = y z = z t = p ( t n v x ) p = 1 1 n v 2

The variable n can be seen as a space-time constant whose value has to be determined by experiment or taken from a known physical law such as electrodynamics. For that purpose, Ignatowski used the above-mentioned Heaviside ellipsoid representing a contraction of electrostatic fields by x / γ in the direction of motion. It can be seen that this is only consistent with Ignatowski's transformation when n = 1 / c 2 , resulting in p = γ and the Lorentz transformation. With n = 0 , no length changes arise and the Galilean transformation follows. Ignatowski's method was further developed and improved by Philipp Frank and Hermann Rothe (1911, 1912), with various authors developing similar methods in subsequent years.

References

History of Lorentz transformations Wikipedia