§ 4. The Lorentz transformation
Our purpose is now to obtain the formula of transformation from one inertial reference system to another, that is, a formula by means of which, knowing the coordinates , of a certain event in the system, we can find the coordinates of the same event in another inertial system .
In classical mechanics this question is resolved very simply. Because of the absolute nature of time we there have ; if, furthermore, the coordinate axes are chosen as usual (axes coincident, axes parallel to , motion along ) then the coordinates clearly are equal to , while the coordinates and differ by the distance traversed by one system relative to the other. If the time origin is chosen as the moment when the two coordinate systems coincide, and if the velocity of the system relative to is , then this distance is . Thus
† It is assumed, of course, that the points and and the curves joining them are such that all elements along the curves are timelike.
This property of the integral is connected with the pseudo-euclidean character of the four-dimensional geometry. In euclidean space the integral would, of course, be a minimum along the straight line.
This formula is called the Galileo transformation. It is easy to verify that this transformation, as was to be expected, does not satisfy the requirements of the theory of relativity; it does not leave the interval between events invariant.
We shall obtain the relativistic transformation precisely as a consequence of the requirement that it leaves the interval between events invariant.
As we saw in § 2, the interval between events can be looked on as the distance between the corresponding pair of world points in a four-dimensional system of coordinates. Consequently we may say that the required transformation must leave unchanged all distances in the four-dimensional , space. But such transformations consist only of parallel displacements, and rotations of the coordinate system. Of these the displacement of the coordinate system parallel to itself is of no interest, since it leads only to a shift in the origin of the space coordinates and a change in the time reference point. Thus the required transformation must be expressible mathematically as a rotation of the four-dimensional , coordinate system.
Every rotation in the four-dimensional space can be resolved into six rotations, in the planes (just as every rotation in ordinary space can be resolved into three rotations in the planes and ). The first three of these rotations transform only the space coordinates; they correspond to the usual space rotations.
Let us consider a rotation in the plane; under this, the and coordinates do not change. In particular, this transformation must leave unchanged the difference , the square of the “distance” of the point from the origin. The relation between the old and the new coordinates is given in most general form by the formulas:
where is the “angle of rotation”; a simple check shows that in fact . Formula (4.2) differs from the usual formulas for transformation under rotation of the coordinate axes in having hyperbolic functions in place of trigonometric functions. This is the difference between pseudo-euclidean and euclidean geometry.
We try to find the formula of transformation from an inertial reference frame to a system moving relative to with velocity along the axis. In this case clearly only the coordinate and the time are subject to change. Therefore this transformation must have the form (4.2). Now it remains only to determine the angle , which can depend only on the relative velocity .†
Let us consider the motion, in the system, of the origin of the system. Then and formulas (4.2) take the form:
or dividing one by the other,
But is clearly the velocity of the system relative to . So
† Note that to avoid confusion we shall always use to signify the constant relative velocity of two inertial systems, and for the velocity of a moving particle, not necessarily constant.
From this
Substituting in (4.2), we find:
This is the required transformation formula. It is called the Lorentz transformation, and is of fundamental importance for what follows.
The inverse formulas, expressing in terms of , are most easily obtained by changing to (since the system moves with velocity relative to the system). The same formulas can be obtained directly by solving equations (4.3) for .
It is easy to see from (4.3) that on making the transition to the limit and classical mechanics, the formula for the Lorentz transformation actually goes over into the Galileo transformation.
For in formula (4.3) the coordinates are imaginary; this corresponds to the fact that motion with a velocity greater than the velocity of light is impossible. Moreover, one cannot use a reference system moving with the velocity of light—in that case the denominators in (4.3) would go to zero.
For velocities small compared with the velocity of light, we can use in place of (4.3) the approximate formulas:
Suppose there is a rod at rest in the system, parallel to the axis. Let its length, measured in this system, be ( and are the coordinates of the two ends of the rod in the system). We now determine the length of this rod as measured in the system. To do this we must find the coordinates of the two ends of the rod ( and ) in this system at one and the same time . From (4.3) we find:
The length of the rod in the system is ; subtracting from , we find
The proper length of a rod is its length in a reference system in which it is at rest. Let us denote it by , and the length of the rod in any other reference frame by . Then
Thus a rod has its greatest length in the reference system in which it is at rest. Its length in a system in which it moves with velocity is decreased by the factor . This result of the theory of relativity is called the Lorentz contraction.
Since the transverse dimensions do not change because of its motion, the volume of a body decreases according to the similar formula
where is the proper volume of the body.
From the Lorentz transformation we can obtain a new the results already known to us concerning the proper time (§ 3). Suppose a clock to be at rest in the system. We take two events occurring at one and the same point in space in the system. The time between these events in the system is . Now we find the time which elapses between these two events in the system. From (4.3), we have
or, subtracting one from the other,
in complete agreement with (3.1).
Finally we mention another general property of Lorentz transformations which distinguishes them from Galilean transformations. The latter have the general property of commutativity, i.e. the combined result of two successive Galilean transformations (with different velocities and ) does not depend on the order in which the transformations are performed. On the other hand, the result of two successive Lorentz transformations does depend, in general, on their order. This is already apparent purely mathematically from our formal description of these transformations as rotations of the four-dimensional coordinate system: we know that the result of two rotations (about different axes) depends on the order in which they are carried out. The sole exception is the case of transformations with parallel vectors and (which are equivalent to two rotations of the four-dimensional coordinate system about the same axis).