§ 6. Four-vectors

The coordinates of an event (ct,x,y,z)(ct, x, y, z) can be considered as the components of a four-dimensional radius vector (or, for short, a four-radius vector) in a four-dimensional space. We shall denote its components by xix^i , where the index ii takes on the values 0, 1, 2, 3, and

x0=ct,x1=x,x2=y,x3=z.x^0 = ct, \quad x^1 = x, \quad x^2 = y, \quad x^3 = z.

The square of the “length” of the radius four-vector is given by

(x0)2(x1)2(x2)2(x3)2.(x^0)^2 - (x^1)^2 - (x^2)^2 - (x^3)^2.

It does not change under any rotations of the four-dimensional coordinate system, in particular under Lorentz transformations.

In general a set of four quantities A0,A1,A2,A3A^0, A^1, A^2, A^3 which transform like the components of the radius four-vector xix^i under transformations of the four-dimensional coordinate system is called a four-dimensional vector (four-vector) AiA^i . Under Lorentz transformations,

A0=A0+VcA11V2c2,A1=A1+VcA01V2c2,A2=A2,A3=A3.(6.1)A^0 = \frac{A'^0 + \frac{V}{c}A'^1}{\sqrt{1 - \frac{V^2}{c^2}}}, \quad A^1 = \frac{A'^1 + \frac{V}{c}A'^0}{\sqrt{1 - \frac{V^2}{c^2}}}, \quad A^2 = A'^2, \quad A^3 = A'^3. \quad \tag{6.1}

The square magnitude of any four-vector is defined analogously to the square of the radius four-vector:

(A0)2(A1)2(A2)2(A3)2.(A^0)^2 - (A^1)^2 - (A^2)^2 - (A^3)^2.

For convenience of notation, we introduce two “types” of components of four-vectors, denoting them by the symbols AiA^i and AiA_i , with superscripts and subscripts. These are related by

A0=A0,A1=A1,A2=A2,A3=A3.(6.2)A_0 = A^0, \quad A_1 = -A^1, \quad A_2 = -A^2, \quad A_3 = -A^3. \quad \tag{6.2}

The square of the four-vector then appears in the form

i=03AiAi=A0A0+A1A1+A2A2+A3A3.\sum_{i=0}^{3} A^i A_i = A^0 A_0 + A^1 A_1 + A^2 A_2 + A^3 A_3.

Such sums are customarily written simply as AiAiA^i A_i , omitting the summation sign. One agrees that one sums over any repeated index, and omits the summation sign. Of the pair of indices, one must be a superscript and the other a subscript. This convention for summation over “dummy” indices is very convenient and considerably simplifies the writing of formulas.

We shall use Latin letters i,k,l,i, k, l, \dots , for four-dimensional indices, taking on the values 0, 1, 2, 3.

In analogy to the square of a four-vector, one forms the scalar product of two different four-vectors:

AiBi=A0B0+A1B1+A2B2+A3B3.A^i B_i = A^0 B_0 + A^1 B_1 + A^2 B_2 + A^3 B_3.

It is clear that this can be written either as AiBiA^i B_i or AiBiA_i B^i —the result is the same. In general one can switch upper and lower indices in any pair of dummy indices.†

The product AiBiA^i B_i is a four-scalar—it is invariant under rotations of the four-dimensional coordinate system. This is easily verified directly,‡ but it is also apparent beforehand (from the analogy with the square AiAiA^i A_i ) from the fact that all four-vectors transform according to the same rule.

† In the literature the indices are often omitted on four-vectors, and their squares and scalar products are written as A2A^2 , ABAB . We shall not use this notation in the present text.

‡ One should remember that the law for transformation of a four-vector expressed in covariant components differs (in signs) from the same law expressed for contravariant components. Thus, instead of (6.1), one will have:

A0=A0VcA11V2c2,A1=A1VcA01V2c2,A2=A2,A3=A3.A_0 = \frac{A'_0 - \frac{V}{c}A'_1}{\sqrt{1 - \frac{V^2}{c^2}}}, \quad A_1 = \frac{A'_1 - \frac{V}{c}A'_0}{\sqrt{1 - \frac{V^2}{c^2}}}, \quad A_2 = A'_2, \quad A_3 = A'_3.

The component A0A^0 is called the time component, and A1,A2,A3A^1, A^2, A^3 the space components of the four-vector (in analogy to the radius four-vector). The square of a four-vector can be positive, negative, or zero; such vectors are called, timelike, spacelike, and null-vectors, respectively (again in analogy to the terminology for intervals).†

Under purely spatial rotations (i.e. transformations not affecting the time axis) the three space components of the four-vector AiA^i form a three-dimensional vector AA . The time component of the four-vector is a three-dimensional scalar (with respect to these transformations). In enumerating the components of a four-vector, we shall often write them as

Ai=(A0,A).A^i = (A^0, A).

The covariant components of the same four-vector are Ai=(A0,A)A_i = (A^0, -A) , and the square of the four-vector is AiAi=(A0)2A2A^i A_i = (A^0)^2 - A^2 . Thus, for the radius four-vector:

xi=(ct,r),xi=(ct,r),xixi=c2t2r2.x^i = (ct, r), \qquad x_i = (ct, -r), \qquad x^i x_i = c^2 t^2 - r^2.

For three-dimensional vectors (with coordinates x,y,zx, y, z ) there is no need to distinguish between contra- and covariant components. Whenever this can be done without causing confusion, we shall write their components as AαA_\alpha ( α=x,y,z\alpha = x, y, z ) using Greek letters for subscripts. In particular we shall assume a summation over x,y,zx, y, z for any repeated index (for example, AB=AαBαA \cdot B = A_\alpha B_\alpha ).

A four-dimensional tensor (four-tensor) of the second rank is a set of sixteen quantities AikA^{ik} , which under coordinate transformations transform like the products of components of two four-vectors. We similarly define four-tensors of higher rank.

The components of a second-rank tensor can be written in three forms: covariant, AikA_{ik} , contravariant, AikA^{ik} , and mixed, AkiA^i_k (where, in the last case, one should distinguish between AkiA^i_k and AikA^k_i , i.e. one should be careful about which of the two is superscript and which a subscript). The connection between the different types of components is determined from the general rule: raising or lowering a space index (1, 2, 3) changes the sign of the component, while raising or lowering the time index (0) does not. Thus:

A00=A00,A01=A01,A11=A11,,A00=A00,A01=A01,A10=A01,A11=A11,A_{00} = A^{00}, \qquad A_{01} = -A^{01}, \qquad A_{11} = A^{11}, \qquad \dots, \qquad A_0^0 = A^{00}, \qquad A_0^1 = A^{01}, \qquad A_1^0 = -A^{01}, \qquad A_1^1 = -A^{11}, \qquad \dots

Under purely spatial transformations, the nine quantities A11,A12,A^{11}, A^{12}, \dots form a three-tensor. The three components A01,A02,A03A^{01}, A^{02}, A^{03} and the three components A10,A20,A30A^{10}, A^{20}, A^{30} constitute three-dimensional vectors, while the component A00A^{00} is a three-dimensional scalar.

A tensor AikA^{ik} is said to be symmetric if Aik=AkiA^{ik} = A^{ki} , and antisymmetric if Aik=AkiA^{ik} = -A^{ki} . In an antisymmetric tensor, all the diagonal components (i.e. the components A00,A11,A^{00}, A^{11}, \dots ) are zero, since, for example, we must have A00=A00A^{00} = -A^{00} . For a symmetric tensor AikA^{ik} , the mixed components AkiA^i_k and AikA^k_i obviously coincide; in such cases we shall simply write AkiA^i_k , putting the indices one above the other.

In every tensor equation, the two sides must contain identical and identically placed (i.e. above or below) free indices (as distinguished from dummy indices). The free indices in tensor equations can be shifted up or down, but this must be done simultaneously in all terms in the equation. Equating covariant and contravariant components of different tensors is “illegal”; such an equation, even if it happened by chance to be valid in a particular reference system, would be violated on going to another frame.

† Null vectors are also said to be isotropic.

From the tensor components AikA^{ik} one can form a scalar by taking the sum

Aii=A00+A11+A22+A33A^i_i = A^0_0 + A^1_1 + A^2_2 + A^3_3

(where, of course, Aii=AiiA^i_i = A^i_i ). This sum is called the trace of the tensor, and the operation for obtaining it is called contraction.

The formation of the scalar product of two vectors, considered earlier, is a contraction operation: it is the formation of the scalar AiBiA^i B_i from the tensor AiBkA^i B_k . In general, contracting on any pair of indices reduces the rank of the tensor by 2. For example, AkliA^i_{kl} is a tensor of second rank AkiBkA^i_k B^k is a four-vector, AikA^{ik} is a scalar, etc.

The unit four-tensor δki\delta^i_k satisfies the condition that for any four-vector AiA^i ,

δikAi=Ak.(6.3)\delta^k_i A^i = A^k. \quad \tag{6.3}

It is clear that the components of this tensor are

δik={1,if i=k0,if ik(6.4)\delta^k_i = \begin{cases} 1, & \text{if } i = k \\ 0, & \text{if } i \neq k \end{cases} \quad \tag{6.4}

Its trace is δii=4\delta^i_i = 4 .

By raising the one index or lowering the other in δik\delta^k_i , we can obtain the contravariant tensor gikg^{ik} or gikg_{ik} , which is called the metric tensor. The tensors gikg^{ik} and gikg_{ik} have identical components, which can be written as a matrix:

(gik)=(gik)=(1000010000100001)(6.5)(g^{ik}) = (g_{ik}) = \begin{pmatrix} 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & -1 \end{pmatrix} \quad \tag{6.5}

(the index ii labels the rows, and kk the columns, in the order 0, 1, 2, 3). It is clear that

gikAk=Ai,gikAk=Ai.(6.6)g_{ik} A^k = A_i, \quad g^{ik} A_k = A^i. \quad \tag{6.6}

The scalar product of two four-vectors can therefore be written in the form:

AiAi=gikAiAk=gikAiAk.(6.7)A^i A_i = g_{ik} A^i A^k = g^{ik} A_i A_k. \quad \tag{6.7}

The tensors δki\delta^i_k , gikg_{ik} , gikg^{ik} are special in the sense that their components are the same in all coordinate systems. The completely antisymmetric unit tensor of fourth rank, eiklme^{iklm} , has the same property. This is the tensor whose components change sign under interchange of any pair of indices, and whose nonzero components are ±1\pm 1 . From the antisymmetry it follows that all components in which two indices are the same are zero, so that the only non-vanishing components are those for which all four indices are different. We set

e0123=+1(6.8)e^{0123} = +1 \quad \tag{6.8}

(hence e0123=1e_{0123} = -1 ). Then all the other nonvanishing components eiklme^{iklm} are equal to +1+1 or 1-1 , according as the numbers i,k,l,mi, k, l, m can be brought to the arrangement 0, 1, 2, 3 by an even or an odd number of transpositions. The number of such components is 4!=244! = 24 . Thus,

eiklmeiklm=24.(6.9)e^{iklm} e_{iklm} = -24. \quad \tag{6.9}

With respect to rotations of the coordinate system, the quantities eiklme^{iklm} behave like the components of a tensor; but if we change the sign of one or three of the coordinates the components eiklme^{iklm} , being defined as the same in all coordinate systems, do not change, whereas some of the components of a tensor should change sign. Thus eiklme^{iklm} is, strictly speaking, not a tensor, but rather a pseudotensor. Pseudotensors of any rank, in particular pseudoscalars, behave like tensors under all coordinate transformations except those that cannot be reduced to rotations, i.e. reflections, which are changes in sign of the coordinates that are not reducible to a rotation.

The products eiklmeprste^{iklm} e^{prst} form a four-tensor of rank 8, which is a true tensor; by contracting on one or more pairs of indices, one obtains tensors of rank 6, 4, and 2. All these tensors have the same form in all coordinate systems. Thus their components must be expressed as combinations of products of components of the unit tensor δki\delta_k^i — the only true tensor whose components are the same in all coordinate systems. These combinations can easily be found by starting from the symmetries that they must possess under permutation of indices.†

If AikA^{ik} is an antisymmetric tensor, the tensor AikA^{ik} and the pseudotensor Aik=12eiklmAlmA^{*ik} = \frac{1}{2} e^{iklm} A_{lm} are said to be dual to one another. Similarly, eiklmAme^{iklm} A_m is an antisymmetric pseudotensor of rank 3, dual to the vector AiA^i . The product AikAikA^{ik} A_{ik}^* of dual tensors is obviously a pseudoscalar.

In this connection we note some analogous properties of three-dimensional vectors and tensors. The completely antisymmetric unit pseudotensor of rank 3 is the set of quantities eαβγe_{\alpha\beta\gamma} which change sign under any transposition of a pair of indices. The only nonvanishing components of eαβγe_{\alpha\beta\gamma} are those with three different indices. We set exyz=1e_{xyz} = 1 ; the others are 1 or -1, depending on whether the sequence α,β,γ\alpha, \beta, \gamma can be brought to the order x,y,zx, y, z by an even or an odd number of transpositions.‡

† For reference we give the following formulas:

eiklmeprst=δpiδriδsiδtiδpkδrkδskδtkδplδrlδslδtlδpmδrmδsmδtm,eiklmeprsm=δpiδriδsiδtiδpkδrkδskδtkδplδrlδslδtlδpmδrmδsmδtme^{iklm} e_{prst} = - \begin{vmatrix} \delta_p^i & \delta_r^i & \delta_s^i & \delta_t^i \\ \delta_p^k & \delta_r^k & \delta_s^k & \delta_t^k \\ \delta_p^l & \delta_r^l & \delta_s^l & \delta_t^l \\ \delta_p^m & \delta_r^m & \delta_s^m & \delta_t^m \end{vmatrix}, \quad e^{iklm} e_{prsm} = - \begin{vmatrix} \delta_p^i & \delta_r^i & \delta_s^i & \delta_t^i \\ \delta_p^k & \delta_r^k & \delta_s^k & \delta_t^k \\ \delta_p^l & \delta_r^l & \delta_s^l & \delta_t^l \\ \delta_p^m & \delta_r^m & \delta_s^m & \delta_t^m \end{vmatrix}eiklmeprlm=2(δpiδrkδriδpk),e^{iklm} e_{prlm} = -2(\delta_p^i \delta_r^k - \delta_r^i \delta_p^k),eiklmeprlm=6δpi.e^{iklm} e_{prlm} = -6\delta_p^i.

The overall coefficient in these formulas can be checked using the result of a complete contraction, which should give (6.9).

As a consequence of these formulas we have:

eprstAipAkrAlsAmt=Aeiklm.e^{prst} A_{ip} A_{kr} A_{ls} A_{mt} = -A e_{iklm}.eiklmeprstAipAkrAlsAmt=24A.e^{iklm} e^{prst} A_{ip} A_{kr} A_{ls} A_{mt} = 24A.

where AA is the determinant formed from the quantities AikA_{ik} .

‡ The fact that the components of the four-tensor eiklme^{iklm} are unchanged under rotations of the four-dimensional coordinate system, and that the components of the three-tensor eαβγe_{\alpha\beta\gamma} are unchanged by rotations of the space axes are special cases of a general rule: any completely antisymmetric tensor of rank equal to the number of dimensions of the space in which it is defined is invariant under rotations of the coordinate system in the space.

The products eαβγeλμνe_{\alpha\beta\gamma}e_{\lambda\mu\nu} form a true three-dimensional tensor of rank 6, and are therefore expressible as combinations of products of components of the unit three-tensor δαβ\delta_{\alpha\beta} .†

Under a reflection of the coordinate system, i.e. under a change in sign of all the coordinates, the components of an ordinary vector also change sign. Such vectors are said to be polar. The components of a vector that can be written as the cross product of two polar vectors do not change sign under inversion. Such vectors are said to be axial. The scalar product of a polar and an axial vector is not a true scalar, but rather a pseudoscalar; it changes sign under a coordinate inversion. An axial vector is a pseudovector, dual to some antisymmetric tensor. Thus, if C=A×BC = A \times B , then

Cα=12eαβγCβγ,whereCβγ=AβBγAγBβ.C_{\alpha} = \frac{1}{2} e_{\alpha\beta\gamma} C_{\beta\gamma}, \quad \text{where} \quad C_{\beta\gamma} = A_{\beta} B_{\gamma} - A_{\gamma} B_{\beta}.

Now consider four-tensors. The space components ( i,k,=1,2,3i, k, = 1, 2, 3 ) of the antisymmetric tensor AikA^{ik} form a three-dimensional antisymmetric tensor with respect to purely spatial transformations; according to our statement its components can be expressed in terms of the components of a three-dimensional axial vector. With respect to these same transformations the components A01,A02,A03A^{01}, A^{02}, A^{03} form a three-dimensional polar vector. Thus the components of an antisymmetric four-tensor can be written as a matrix:

(Aik)=0pxpypzpx0azaypyaz0axpzayax0,(6.10)(A^{ik}) = \begin{vmatrix} 0 & p_x & p_y & p_z \\ -p_x & 0 & -a_z & a_y \\ -p_y & a_z & 0 & -a_x \\ -p_z & -a_y & a_x & 0 \end{vmatrix}, \quad \tag{6.10}

where, with respect to spatial transformations, pp and aa are polar and axial vectors, respectively. In enumerating the components of an antisymmetric four-tensor, we shall write them in the form

Aik=(p,a);A^{ik} = (p, a);

then the covariant components of the same tensor are

Aik=(p,a).A_{ik} = (-p, a).

Finally we consider certain differential and integral operations of four-dimensional tensor analysis.

The four-gradient of a scalar ϕ\phi is the four-vector

† For reference, we give the appropriate formulas:

eαβγeλμν=δαλδαμδανδβλδβμδβνδγλδγμδγν.e_{\alpha\beta\gamma}e_{\lambda\mu\nu} = \begin{vmatrix} \delta_{\alpha\lambda} & \delta_{\alpha\mu} & \delta_{\alpha\nu} \\ \delta_{\beta\lambda} & \delta_{\beta\mu} & \delta_{\beta\nu} \\ \delta_{\gamma\lambda} & \delta_{\gamma\mu} & \delta_{\gamma\nu} \end{vmatrix}.

Contracting this tensor on one, two and three pairs of indices, we get:

eαβγeλμγ=δαλδβμδαμδβλ,e_{\alpha\beta\gamma}e_{\lambda\mu\gamma} = \delta_{\alpha\lambda}\delta_{\beta\mu} - \delta_{\alpha\mu}\delta_{\beta\lambda},eαβγeλβγ=2δαλ,e_{\alpha\beta\gamma}e_{\lambda\beta\gamma} = 2\delta_{\alpha\lambda},eαβγeαβγ=6.e_{\alpha\beta\gamma}e_{\alpha\beta\gamma} = 6.ϕxi=(1cϕt,ϕ).\frac{\partial \phi}{\partial x^i} = \left( \frac{1}{c} \frac{\partial \phi}{\partial t}, \nabla \phi \right).

We must remember that these derivatives are to be regarded as the covariant components of the four-vector. In fact, the differential of the scalar

dϕ=ϕxidxid\phi = \frac{\partial \phi}{\partial x^i} dx^i

is also a scalar; from its form (scalar product of two four-vectors) our assertion is obvious.

In general, the operators of differentiation with respect to the coordinates xix^i , /xi\partial/\partial x^i , should be regarded as the covariant components of the operator four-vector. Thus, for example, the divergence of a four-vector, the expression Ai/xi\partial A^i/\partial x^i , in which we differentiate the contravariant components AiA^i , is a scalar.†

In three-dimensional space one can extend integrals over a volume, a surface or a curve. In four-dimensional space there are four types of integrations:

(1) Integral over a curve in four-space. The element of integration is the line element, i.e. the four-vector dxidx^i .

(2) Integral over a (two-dimensional) surface in four-space. As we know, in three-space the projections of the area of the parallelogram formed from the vectors drd\mathbf{r} and drd\mathbf{r}' on the coordinate planes xαxβx_\alpha x_\beta are dxαdxβdxβdxαdx_\alpha dx'_\beta - dx_\beta dx'_\alpha . Analogously, in four-space the infinitesimal element of surface is given by the antisymmetric tensor of second rank dfik=dxidxkdxkdxidf^{ik} = dx^i dx'^k - dx^k dx'^i ; its components are the projections of the element of area on the coordinate planes. In three-dimensional space, as we know, one uses as surface element in place of the tensor dfαβdf_{\alpha\beta} the vector dfαdf_\alpha dual to the tensor dfαβdf_{\alpha\beta} : dfα=12eαβγdfβγdf_\alpha = \frac{1}{2} e_{\alpha\beta\gamma} df_{\beta\gamma} . Geometrically this is a vector normal to the surface element and equal in absolute magnitude to the area of the element. In four-space we cannot construct such a vector, but we can construct the tensor dfikdf^{*ik} dual to the tensor dfikdf^{ik} ,

dfik=12eiklmdflm.(6.11)df^{*ik} = \frac{1}{2} e^{iklm} df_{lm}. \quad \tag{6.11}

Geometrically it describes an element of surface equal to and “normal” to the element of

† If we differentiate with respect to the “covariant coordinates” xix_i , then the derivatives

ϕxi=(1cϕt,ϕ)\frac{\partial \phi}{\partial x_i} = \left( \frac{1}{c} \frac{\partial \phi}{\partial t}, -\nabla \phi \right)

form the contravariant components of a four-vector. We shall use this form only in exceptional cases [for example, for writing the square of the four-gradient (ϕ/xi)(ϕ/xi)(\partial\phi/\partial x^i)(\partial\phi/\partial x_i) ].

We note that in the literature partial derivatives with respect to the coordinates are often abbreviated using the symbols.

i=xi,i=xi.\partial^i = \frac{\partial}{\partial x_i}, \quad \partial_i = \frac{\partial}{\partial x^i}.

In this form of writing of the differentiation operators, the co- or contravariant character of quantities formed with them is explicit. This same advantage exists for another abbreviated form for writing derivatives, using the index preceded by a comma:

ϕi=ϕxi,ϕi=ϕxi.\phi_i = \frac{\partial \phi}{\partial x^i}, \quad \phi^i = \frac{\partial \phi}{\partial x_i}.