§ 4. The Lorentz transformation

Our purpose is now to obtain the formula of transformation from one inertial reference system to another, that is, a formula by means of which, knowing the coordinates x,y,z,tx, y, z, t , of a certain event in the KK system, we can find the coordinates x,y,z,tx', y', z', t' of the same event in another inertial system KK' .

In classical mechanics this question is resolved very simply. Because of the absolute nature of time we there have t=tt = t' ; if, furthermore, the coordinate axes are chosen as usual (axes X,XX, X' coincident, Y,ZY, Z axes parallel to Y,ZY', Z' , motion along X,XX, X' ) then the coordinates y,zy, z clearly are equal to y,zy', z' , while the coordinates xx and xx' differ by the distance traversed by one system relative to the other. If the time origin is chosen as the moment when the two coordinate systems coincide, and if the velocity of the KK' system relative to KK is VV , then this distance is VtVt . Thus

† It is assumed, of course, that the points aa and bb and the curves joining them are such that all elements dsds along the curves are timelike.

This property of the integral is connected with the pseudo-euclidean character of the four-dimensional geometry. In euclidean space the integral would, of course, be a minimum along the straight line.

x=x+Vt,y=y,z=z,t=t.(4.1)x = x' + Vt, \quad y = y', \quad z = z', \quad t = t'. \quad \tag{4.1}

This formula is called the Galileo transformation. It is easy to verify that this transformation, as was to be expected, does not satisfy the requirements of the theory of relativity; it does not leave the interval between events invariant.

We shall obtain the relativistic transformation precisely as a consequence of the requirement that it leaves the interval between events invariant.

As we saw in § 2, the interval between events can be looked on as the distance between the corresponding pair of world points in a four-dimensional system of coordinates. Consequently we may say that the required transformation must leave unchanged all distances in the four-dimensional x,y,z,ctx, y, z, ct , space. But such transformations consist only of parallel displacements, and rotations of the coordinate system. Of these the displacement of the coordinate system parallel to itself is of no interest, since it leads only to a shift in the origin of the space coordinates and a change in the time reference point. Thus the required transformation must be expressible mathematically as a rotation of the four-dimensional x,y,z,ctx, y, z, ct , coordinate system.

Every rotation in the four-dimensional space can be resolved into six rotations, in the planes xy,zy,xz,tx,ty,tzxy, zy, xz, tx, ty, tz (just as every rotation in ordinary space can be resolved into three rotations in the planes xy,zyxy, zy and xzxz ). The first three of these rotations transform only the space coordinates; they correspond to the usual space rotations.

Let us consider a rotation in the txtx plane; under this, the yy and zz coordinates do not change. In particular, this transformation must leave unchanged the difference (ct)2x2(ct)^2 - x^2 , the square of the “distance” of the point (ct,x)(ct, x) from the origin. The relation between the old and the new coordinates is given in most general form by the formulas:

x=xcoshψ+ctsinhψ,ct=xsinhψ+ctcoshψ,(4.2)x = x' \cosh \psi + ct' \sinh \psi, \quad ct = x' \sinh \psi + ct' \cosh \psi, \quad \tag{4.2}

where ψ\psi is the “angle of rotation”; a simple check shows that in fact c2t2x2=c2t2x2c^2 t^2 - x^2 = c^2 t'^2 - x'^2 . Formula (4.2) differs from the usual formulas for transformation under rotation of the coordinate axes in having hyperbolic functions in place of trigonometric functions. This is the difference between pseudo-euclidean and euclidean geometry.

We try to find the formula of transformation from an inertial reference frame KK to a system KK' moving relative to KK with velocity VV along the xx axis. In this case clearly only the coordinate xx and the time tt are subject to change. Therefore this transformation must have the form (4.2). Now it remains only to determine the angle ψ\psi , which can depend only on the relative velocity VV .†

Let us consider the motion, in the KK system, of the origin of the KK' system. Then x=0x' = 0 and formulas (4.2) take the form:

x=ctsinhψ,ct=ctcoshψ,x = ct' \sinh \psi, \quad ct = ct' \cosh \psi,

or dividing one by the other,

xct=tanhψ.\frac{x}{ct} = \tanh \psi.

But x/tx/t is clearly the velocity VV of the KK' system relative to KK . So

† Note that to avoid confusion we shall always use VV to signify the constant relative velocity of two inertial systems, and vv for the velocity of a moving particle, not necessarily constant.

tanhψ=Vc.\tanh \psi = \frac{V}{c}.

From this

sinhψ=V/c1V2/c2,coshψ=11V2/c2.\sinh \psi = \frac{V/c}{\sqrt{1 - V^2/c^2}}, \quad \cosh \psi = \frac{1}{\sqrt{1 - V^2/c^2}}.

Substituting in (4.2), we find:

x=x+Vt1V2/c2,y=y,z=z,t=t+Vc2x1V2/c2.(4.3)x = \frac{x' + Vt'}{\sqrt{1 - V^2/c^2}}, \quad y = y', \quad z = z', \quad t = \frac{t' + \frac{V}{c^2}x'}{\sqrt{1 - V^2/c^2}}. \quad \tag{4.3}

This is the required transformation formula. It is called the Lorentz transformation, and is of fundamental importance for what follows.

The inverse formulas, expressing x,y,z,tx', y', z', t' in terms of x,y,z,tx, y, z, t , are most easily obtained by changing VV to V-V (since the KK system moves with velocity V-V relative to the KK' system). The same formulas can be obtained directly by solving equations (4.3) for x,y,z,tx', y', z', t' .

It is easy to see from (4.3) that on making the transition to the limit cc \to \infty and classical mechanics, the formula for the Lorentz transformation actually goes over into the Galileo transformation.

For V>cV > c in formula (4.3) the coordinates x,tx, t are imaginary; this corresponds to the fact that motion with a velocity greater than the velocity of light is impossible. Moreover, one cannot use a reference system moving with the velocity of light—in that case the denominators in (4.3) would go to zero.

For velocities VV small compared with the velocity of light, we can use in place of (4.3) the approximate formulas:

x=x+Vt,y=y,z=z,t=t+Vc2x.(4.4)x = x' + Vt', \quad y = y', \quad z = z', \quad t = t' + \frac{V}{c^2}x'. \quad \tag{4.4}

Suppose there is a rod at rest in the KK system, parallel to the XX axis. Let its length, measured in this system, be Δx=x2x1\Delta x = x_2 - x_1 ( x2x_2 and x1x_1 are the coordinates of the two ends of the rod in the KK system). We now determine the length of this rod as measured in the KK' system. To do this we must find the coordinates of the two ends of the rod ( x2x_2' and x1x_1' ) in this system at one and the same time tt' . From (4.3) we find:

x1=x1+Vt1V2/c2,x2=x2+Vt1V2/c2.x_1 = \frac{x_1' + Vt'}{\sqrt{1 - V^2/c^2}}, \quad x_2 = \frac{x_2' + Vt'}{1 - V^2/c^2}.

The length of the rod in the KK' system is Δx=x2x1\Delta x' = x_2' - x_1' ; subtracting x1x_1 from x2x_2 , we find

Δx=Δx1V2/c2.\Delta x = \frac{\Delta x'}{\sqrt{1 - V^2/c^2}}.

The proper length of a rod is its length in a reference system in which it is at rest. Let us denote it by l0=Δxl_0 = \Delta x , and the length of the rod in any other reference frame KK' by ll . Then

l=l01V2c2.(4.5)l = l_0 \sqrt{1 - \frac{V^2}{c^2}}. \quad \tag{4.5}

Thus a rod has its greatest length in the reference system in which it is at rest. Its length in a system in which it moves with velocity VV is decreased by the factor 1V2/c2\sqrt{1 - V^2/c^2} . This result of the theory of relativity is called the Lorentz contraction.

Since the transverse dimensions do not change because of its motion, the volume V\mathcal{V} of a body decreases according to the similar formula

V=V01V2c2,(4.6)\mathcal{V} = \mathcal{V}_0 \sqrt{1 - \frac{V^2}{c^2}}, \quad \tag{4.6}

where V0\mathcal{V}_0 is the proper volume of the body.

From the Lorentz transformation we can obtain a new the results already known to us concerning the proper time (§ 3). Suppose a clock to be at rest in the KK' system. We take two events occurring at one and the same point x,y,zx', y', z' in space in the KK' system. The time between these events in the KK' system is Δt=t2t1\Delta t' = t_2' - t_1' . Now we find the time Δt\Delta t which elapses between these two events in the KK system. From (4.3), we have

t1=t1+Vc2x1V2c2,t2=t2+Vc2x1V2c2,t_1 = \frac{t_1' + \frac{V}{c^2}x'}{\sqrt{1 - \frac{V^2}{c^2}}}, \quad t_2 = \frac{t_2' + \frac{V}{c^2}x'}{\sqrt{1 - \frac{V^2}{c^2}}},

or, subtracting one from the other,

t2t1=Δt=Δt1V2c2,t_2 - t_1 = \Delta t = \frac{\Delta t'}{\sqrt{1 - \frac{V^2}{c^2}}},

in complete agreement with (3.1).

Finally we mention another general property of Lorentz transformations which distinguishes them from Galilean transformations. The latter have the general property of commutativity, i.e. the combined result of two successive Galilean transformations (with different velocities V1V_1 and V2V_2 ) does not depend on the order in which the transformations are performed. On the other hand, the result of two successive Lorentz transformations does depend, in general, on their order. This is already apparent purely mathematically from our formal description of these transformations as rotations of the four-dimensional coordinate system: we know that the result of two rotations (about different axes) depends on the order in which they are carried out. The sole exception is the case of transformations with parallel vectors V1V_1 and V2V_2 (which are equivalent to two rotations of the four-dimensional coordinate system about the same axis).