Vector Spaces

2.1 Definition A complex vector space (or a vector space over the complex field) is a set VV , whose elements are called vectors and in which two operations, called addition and scalar multiplication, are defined, with the following familiar algebraic properties:

To every pair of vectors xx and yy there corresponds a vector x+yx + y , in such a way that x+y=y+xx + y = y + x and x+(y+z)=(x+y)+zx + (y + z) = (x + y) + z ; VV contains a unique vector 00 (the zero vector or origin of VV ) such that x+0=xx + 0 = x for every xVx \in V ; and to each xVx \in V there corresponds a unique vector x-x such that x+(x)=0x + (-x) = 0 .

To each pair (α,x)(\alpha, x) , where xVx \in V and α\alpha is a scalar (in this context, the word scalar means complex number), there is associated a vector αxV\alpha x \in V , in such a way that 1x=x1x = x , α(βx)=(αβ)x\alpha(\beta x) = (\alpha\beta)x , and such that the two distributive laws

α(x+y)=αx+αy,(α+β)x=αx+βx(1)\alpha(x + y) = \alpha x + \alpha y, \quad (\alpha + \beta)x = \alpha x + \beta x \quad (1)

hold.

A linear transformation of a vector space VV into a vector space V1V_1 is a mapping Λ\Lambda of VV into V1V_1 such that

Λ(αx+βy)=αΛx+βΛy(2)\Lambda(\alpha x + \beta y) = \alpha \Lambda x + \beta \Lambda y \quad (2)

for all xx and yVy \in V and for all scalars α\alpha and β\beta . In the special case in which V1V_1 is the field of scalars (this is the simplest example of a vector space, except for the trivial one consisting of 00 alone), Λ\Lambda is called a linear functional. A linear functional is thus a complex function on VV which satisfies (2).

Note that one often writes Λx\Lambda x , rather than Λ(x)\Lambda(x) , if Λ\Lambda is linear.

The preceding definitions can of course be made equally well with any field whatsoever in place of the complex field. Unless the contrary is explicitly stated, however, all vector spaces occurring in this book will be complex, with one notable exception: the euclidean spaces RkR^k are vector spaces over the real field.

2.2 Integration as a Linear Functional Analysis is full of vector spaces and linear transformations, and there is an especially close relationship between integration on the one hand and linear functionals on the other.

For instance, Theorem 1.32 shows that L1(μ)L^1(\mu) is a vector space, for any positive measure μ\mu , and that the mapping

fXfdμ(1)f \to \int_X f d\mu \quad (1)

is a linear functional on L1(μ)L^1(\mu) . Similarly, if gg is any bounded measurable function, the mapping

fXfgdμ(2)f \to \int_X fg d\mu \quad (2)

is a linear functional on L1(μ)L^1(\mu) ; we shall see in Chap. 6 that the functionals (2) are, in a sense, the only interesting ones on L1(μ)L^1(\mu) .

For another example, let CC be the set of all continuous complex functions on the unit interval I=[0,1]I = [0, 1] . The sum of the two continuous functions is continuous, and so is any scalar multiple of a continuous function. Hence CC is a vector space, and if

Λf=01f(x)dx(fC),(3)\Lambda f = \int_0^1 f(x) dx \quad (f \in C), \quad (3)

the integral being the ordinary Riemann integral, then Λ\Lambda is clearly a linear functional on CC ; Λ\Lambda has an additional interesting property: it is a positive linear functional. This means that Λf0\Lambda f \ge 0 whenever f0f \ge 0 .

One of the tasks which is still ahead of us is the construction of the Lebesgue measure. The construction can be based on the linear functional (3), by the following observation: Consider a segment (a,b)I(a, b) \subset I and consider the class of all fCf \in C such that 0f10 \le f \le 1 on II and f(x)=0f(x) = 0 for all xx not in (a,b)(a, b) . We have Λf<ba\Lambda f < b - a for all such ff , but we can choose ff so that Λf\Lambda f is as close to bab - a as desired. Thus the length (or measure) of (a,b)(a, b) is intimately related to the values of the functional Λ\Lambda .

The preceding observation, when looked at from a more general point of view, leads to a remarkable and extremely important theorem of F. Riesz:

To every positive linear functional Λ\Lambda on CC corresponds a finite positive Borel measure μ\mu on II such that

Λf=Ifdμ(fC).(4)\Lambda f = \int_I f d\mu \quad (f \in C). \quad (4)

[The converse is obvious: if μ\mu is a finite positive Borel measure on II and if Λ\Lambda is defined by (4), then Λ\Lambda is a positive linear functional on CC .]

It is clearly of interest to replace the bounded interval II by R1R^1 . We can do this by restricting attention to those continuous functions on R1R^1 which vanish outside some bounded interval. (These functions are Riemann integrable, for instance.) Next, functions of several variables occur frequently in analysis. Thus we ought to move from R1R^1 to RnR^n . It turns out that the proof of the Riesz theorem still goes through, with hardly any changes. Moreover, it turns out that the euclidean properties of RnR^n (coordinates, orthogonality, etc.) play no role in the proof; in fact, if one thinks of them too much they just get in the way. Essential to the proof are certain topological properties of RnR^n . (Naturally. We are now dealing with continuous functions.) The crucial property is that of local compactness: Each point of RnR^n has a neighborhood whose closure is compact.

We shall therefore establish the Riesz theorem in a very general setting (Theorem 2.14). The existence of Lebesgue measure follows then as a special case. Those who wish to concentrate on a more concrete situation may skip lightly over the following section on topological preliminaries (Urysohn’s lemma is the item of greatest interest there; see Exercise 3) and may replace locally compact Hausdorff spaces by locally compact metric spaces, or even by euclidean spaces, without missing any of the principal ideas.

It should also be mentioned that there are situations, especially in probability theory, where measures occur naturally on spaces without topology, or on topological spaces that are not locally compact. An example is the so-called Wiener measure which assigns numbers to certain sets of continuous functions and which is a basic tool in the study of Brownian motion. These topics will not be discussed in this book.