§ 6. Four-vectors
The coordinates of an event can be considered as the components of a four-dimensional radius vector (or, for short, a four-radius vector) in a four-dimensional space. We shall denote its components by , where the index takes on the values 0, 1, 2, 3, and
The square of the “length” of the radius four-vector is given by
It does not change under any rotations of the four-dimensional coordinate system, in particular under Lorentz transformations.
In general a set of four quantities which transform like the components of the radius four-vector under transformations of the four-dimensional coordinate system is called a four-dimensional vector (four-vector) . Under Lorentz transformations,
The square magnitude of any four-vector is defined analogously to the square of the radius four-vector:
For convenience of notation, we introduce two “types” of components of four-vectors, denoting them by the symbols and , with superscripts and subscripts. These are related by
The square of the four-vector then appears in the form
Such sums are customarily written simply as , omitting the summation sign. One agrees that one sums over any repeated index, and omits the summation sign. Of the pair of indices, one must be a superscript and the other a subscript. This convention for summation over “dummy” indices is very convenient and considerably simplifies the writing of formulas.
We shall use Latin letters , for four-dimensional indices, taking on the values 0, 1, 2, 3.
In analogy to the square of a four-vector, one forms the scalar product of two different four-vectors:
It is clear that this can be written either as or —the result is the same. In general one can switch upper and lower indices in any pair of dummy indices.†
The product is a four-scalar—it is invariant under rotations of the four-dimensional coordinate system. This is easily verified directly,‡ but it is also apparent beforehand (from the analogy with the square ) from the fact that all four-vectors transform according to the same rule.
† In the literature the indices are often omitted on four-vectors, and their squares and scalar products are written as , . We shall not use this notation in the present text.
‡ One should remember that the law for transformation of a four-vector expressed in covariant components differs (in signs) from the same law expressed for contravariant components. Thus, instead of (6.1), one will have:
The component is called the time component, and the space components of the four-vector (in analogy to the radius four-vector). The square of a four-vector can be positive, negative, or zero; such vectors are called, timelike, spacelike, and null-vectors, respectively (again in analogy to the terminology for intervals).†
Under purely spatial rotations (i.e. transformations not affecting the time axis) the three space components of the four-vector form a three-dimensional vector . The time component of the four-vector is a three-dimensional scalar (with respect to these transformations). In enumerating the components of a four-vector, we shall often write them as
The covariant components of the same four-vector are , and the square of the four-vector is . Thus, for the radius four-vector:
For three-dimensional vectors (with coordinates ) there is no need to distinguish between contra- and covariant components. Whenever this can be done without causing confusion, we shall write their components as ( ) using Greek letters for subscripts. In particular we shall assume a summation over for any repeated index (for example, ).
A four-dimensional tensor (four-tensor) of the second rank is a set of sixteen quantities , which under coordinate transformations transform like the products of components of two four-vectors. We similarly define four-tensors of higher rank.
The components of a second-rank tensor can be written in three forms: covariant, , contravariant, , and mixed, (where, in the last case, one should distinguish between and , i.e. one should be careful about which of the two is superscript and which a subscript). The connection between the different types of components is determined from the general rule: raising or lowering a space index (1, 2, 3) changes the sign of the component, while raising or lowering the time index (0) does not. Thus:
Under purely spatial transformations, the nine quantities form a three-tensor. The three components and the three components constitute three-dimensional vectors, while the component is a three-dimensional scalar.
A tensor is said to be symmetric if , and antisymmetric if . In an antisymmetric tensor, all the diagonal components (i.e. the components ) are zero, since, for example, we must have . For a symmetric tensor , the mixed components and obviously coincide; in such cases we shall simply write , putting the indices one above the other.
In every tensor equation, the two sides must contain identical and identically placed (i.e. above or below) free indices (as distinguished from dummy indices). The free indices in tensor equations can be shifted up or down, but this must be done simultaneously in all terms in the equation. Equating covariant and contravariant components of different tensors is “illegal”; such an equation, even if it happened by chance to be valid in a particular reference system, would be violated on going to another frame.
† Null vectors are also said to be isotropic.
From the tensor components one can form a scalar by taking the sum
(where, of course, ). This sum is called the trace of the tensor, and the operation for obtaining it is called contraction.
The formation of the scalar product of two vectors, considered earlier, is a contraction operation: it is the formation of the scalar from the tensor . In general, contracting on any pair of indices reduces the rank of the tensor by 2. For example, is a tensor of second rank is a four-vector, is a scalar, etc.
The unit four-tensor satisfies the condition that for any four-vector ,
It is clear that the components of this tensor are
Its trace is .
By raising the one index or lowering the other in , we can obtain the contravariant tensor or , which is called the metric tensor. The tensors and have identical components, which can be written as a matrix:
(the index labels the rows, and the columns, in the order 0, 1, 2, 3). It is clear that
The scalar product of two four-vectors can therefore be written in the form:
The tensors , , are special in the sense that their components are the same in all coordinate systems. The completely antisymmetric unit tensor of fourth rank, , has the same property. This is the tensor whose components change sign under interchange of any pair of indices, and whose nonzero components are . From the antisymmetry it follows that all components in which two indices are the same are zero, so that the only non-vanishing components are those for which all four indices are different. We set
(hence ). Then all the other nonvanishing components are equal to or , according as the numbers can be brought to the arrangement 0, 1, 2, 3 by an even or an odd number of transpositions. The number of such components is . Thus,
With respect to rotations of the coordinate system, the quantities behave like the components of a tensor; but if we change the sign of one or three of the coordinates the components , being defined as the same in all coordinate systems, do not change, whereas some of the components of a tensor should change sign. Thus is, strictly speaking, not a tensor, but rather a pseudotensor. Pseudotensors of any rank, in particular pseudoscalars, behave like tensors under all coordinate transformations except those that cannot be reduced to rotations, i.e. reflections, which are changes in sign of the coordinates that are not reducible to a rotation.
The products form a four-tensor of rank 8, which is a true tensor; by contracting on one or more pairs of indices, one obtains tensors of rank 6, 4, and 2. All these tensors have the same form in all coordinate systems. Thus their components must be expressed as combinations of products of components of the unit tensor — the only true tensor whose components are the same in all coordinate systems. These combinations can easily be found by starting from the symmetries that they must possess under permutation of indices.†
If is an antisymmetric tensor, the tensor and the pseudotensor are said to be dual to one another. Similarly, is an antisymmetric pseudotensor of rank 3, dual to the vector . The product of dual tensors is obviously a pseudoscalar.
In this connection we note some analogous properties of three-dimensional vectors and tensors. The completely antisymmetric unit pseudotensor of rank 3 is the set of quantities which change sign under any transposition of a pair of indices. The only nonvanishing components of are those with three different indices. We set ; the others are 1 or -1, depending on whether the sequence can be brought to the order by an even or an odd number of transpositions.‡
† For reference we give the following formulas:
The overall coefficient in these formulas can be checked using the result of a complete contraction, which should give (6.9).
As a consequence of these formulas we have:
where is the determinant formed from the quantities .
‡ The fact that the components of the four-tensor are unchanged under rotations of the four-dimensional coordinate system, and that the components of the three-tensor are unchanged by rotations of the space axes are special cases of a general rule: any completely antisymmetric tensor of rank equal to the number of dimensions of the space in which it is defined is invariant under rotations of the coordinate system in the space.
The products form a true three-dimensional tensor of rank 6, and are therefore expressible as combinations of products of components of the unit three-tensor .†
Under a reflection of the coordinate system, i.e. under a change in sign of all the coordinates, the components of an ordinary vector also change sign. Such vectors are said to be polar. The components of a vector that can be written as the cross product of two polar vectors do not change sign under inversion. Such vectors are said to be axial. The scalar product of a polar and an axial vector is not a true scalar, but rather a pseudoscalar; it changes sign under a coordinate inversion. An axial vector is a pseudovector, dual to some antisymmetric tensor. Thus, if , then
Now consider four-tensors. The space components ( ) of the antisymmetric tensor form a three-dimensional antisymmetric tensor with respect to purely spatial transformations; according to our statement its components can be expressed in terms of the components of a three-dimensional axial vector. With respect to these same transformations the components form a three-dimensional polar vector. Thus the components of an antisymmetric four-tensor can be written as a matrix:
where, with respect to spatial transformations, and are polar and axial vectors, respectively. In enumerating the components of an antisymmetric four-tensor, we shall write them in the form
then the covariant components of the same tensor are
Finally we consider certain differential and integral operations of four-dimensional tensor analysis.
The four-gradient of a scalar is the four-vector
† For reference, we give the appropriate formulas:
Contracting this tensor on one, two and three pairs of indices, we get:
We must remember that these derivatives are to be regarded as the covariant components of the four-vector. In fact, the differential of the scalar
is also a scalar; from its form (scalar product of two four-vectors) our assertion is obvious.
In general, the operators of differentiation with respect to the coordinates , , should be regarded as the covariant components of the operator four-vector. Thus, for example, the divergence of a four-vector, the expression , in which we differentiate the contravariant components , is a scalar.†
In three-dimensional space one can extend integrals over a volume, a surface or a curve. In four-dimensional space there are four types of integrations:
(1) Integral over a curve in four-space. The element of integration is the line element, i.e. the four-vector .
(2) Integral over a (two-dimensional) surface in four-space. As we know, in three-space the projections of the area of the parallelogram formed from the vectors and on the coordinate planes are . Analogously, in four-space the infinitesimal element of surface is given by the antisymmetric tensor of second rank ; its components are the projections of the element of area on the coordinate planes. In three-dimensional space, as we know, one uses as surface element in place of the tensor the vector dual to the tensor : . Geometrically this is a vector normal to the surface element and equal in absolute magnitude to the area of the element. In four-space we cannot construct such a vector, but we can construct the tensor dual to the tensor ,
Geometrically it describes an element of surface equal to and “normal” to the element of
† If we differentiate with respect to the “covariant coordinates” , then the derivatives
form the contravariant components of a four-vector. We shall use this form only in exceptional cases [for example, for writing the square of the four-gradient ].
We note that in the literature partial derivatives with respect to the coordinates are often abbreviated using the symbols.
In this form of writing of the differentiation operators, the co- or contravariant character of quantities formed with them is explicit. This same advantage exists for another abbreviated form for writing derivatives, using the index preceded by a comma: