The Role Played by Sets of Measure Zero

1.35 Definition Let PP be a property which a point xx may or may not have. For instance, PP might be the property " f(x)>0f(x) > 0 " if ff is a given function, or it might be " {fn(x)}\{f_n(x)\} converges" if {fn}\{f_n\} is a given sequence of functions.

If μ\mu is a measure on a σ\sigma -algebra M\mathfrak{M} and if EME \in \mathfrak{M} , the statement " PP holds almost everywhere on EE " (abbreviated to " PP holds a.e. on EE “) means that there exists an NMN \in \mathfrak{M} such that μ(N)=0\mu(N) = 0 , NEN \subset E , and PP holds at every point of ENE - N . This concept of a.e. depends of course very strongly on the given measure, and we shall write “a.e. [μ][\mu] " whenever clarity requires that the measure be indicated.

For example, if ff and gg are measurable functions and if

μ({x:f(x)g(x)})=0,(1)\mu(\{x: f(x) \ne g(x)\}) = 0, \quad (1)

we say that f=gf = g a.e. [μ][\mu] on XX , and we may write fgf \sim g . This is easily seen to be an equivalence relation. The transitivity ( fgf \sim g and ghg \sim h implies fhf \sim h ) is a consequence of the fact that the union of two sets of measure 0 has measure 0.

Note that if fgf \sim g , then, for every EME \in \mathfrak{M} ,

Efdμ=Egdμ.(2)\int_E f \, d\mu = \int_E g \, d\mu. \quad (2)

To see this, let NN be the set which appears in (1); then EE is the union of the disjoint sets ENE - N and ENE \cap N ; on ENE - N , f=gf = g , and μ(EN)=0\mu(E \cap N) = 0 .

Thus, generally speaking, sets of measure 0 are negligible in integration. It ought to be true that every subset of a negligible set is negligible. But it may happen that some set NMN \in \mathcal{M} with μ(N)=0\mu(N) = 0 has a subset EE which is not a member of M\mathcal{M} . Of course we can define μ(E)=0\mu(E) = 0 in this case. But will this extension of μ\mu still be a measure, i.e., will it still be defined on a σ\sigma -algebra? It is a pleasant fact that the answer is affirmative:

1.36 Theorem Let (X,M,μ)(X, \mathcal{M}, \mu) be a measure space, let M\mathcal{M}^* be the collection of all EXE \subset X for which there exist sets AA and BMB \in \mathcal{M} such that AEBA \subset E \subset B and μ(BA)=0\mu(B - A) = 0 , and define μ(E)=μ(A)\mu(E) = \mu(A) in this situation. Then M\mathcal{M}^* is a σ\sigma -algebra, and μ\mu is a measure on M\mathcal{M}^* .

This extended measure μ\mu is called complete, since all subsets of sets of measure 0 are now measurable; the σ\sigma -algebra M\mathcal{M}^* is called the μ\mu -completion of M\mathcal{M} . The theorem says that every measure can be completed, so, whenever it is convenient, we may assume that any given measure is complete; this just gives us more measurable sets, hence more measurable functions. Most measures that one meets in the ordinary course of events are already complete, but there are exceptions; one of these will occur in the proof of Fubini’s theorem in Chap. 8.

PROOF We begin by checking that μ\mu is well defined for every EME \in \mathcal{M}^* . Suppose AEBA \subset E \subset B , A1EB1A_1 \subset E \subset B_1 , and μ(BA)=μ(B1A1)=0\mu(B - A) = \mu(B_1 - A_1) = 0 . (The letters AA and BB will denote members of M\mathcal{M} throughout this proof.) Since

AA1EA1B1A1A - A_1 \subset E - A_1 \subset B_1 - A_1

we have μ(AA1)=0\mu(A - A_1) = 0 , hence μ(A)=μ(AA1)\mu(A) = \mu(A \cap A_1) . For the same reason, μ(A1)=μ(A1A)\mu(A_1) = \mu(A_1 \cap A) . We conclude that indeed μ(A1)=μ(A)\mu(A_1) = \mu(A) .

Next, let us verify that M\mathcal{M}^* has the three defining properties of a σ\sigma -algebra.

(i) XMX \in \mathcal{M}^* , because XMX \in \mathcal{M} and MM\mathcal{M} \subset \mathcal{M}^* . (ii) If AEBA \subset E \subset B then BcEcAcB^c \subset E^c \subset A^c . Thus EME \in \mathcal{M}^* implies EcME^c \in \mathcal{M}^* , because AcBc=AcB=BAA^c - B^c = A^c \cap B = B - A . (iii) If AiEiBiA_i \subset E_i \subset B_i , E=EiE = \bigcup E_i , A=AiA = \bigcup A_i , B=BiB = \bigcup B_i , then AEBA \subset E \subset B and

BA=i=1(BiA)i=1(BiAi).B - A = \bigcup_{i=1}^{\infty} (B_i - A) \subset \bigcup_{i=1}^{\infty} (B_i - A_i).

Since countable unions of sets of measure zero have measure zero, it follows that EME \in \mathcal{M}^* if EiME_i \in \mathcal{M}^* for i=1,2,3,i = 1, 2, 3, \dots .

Finally, if the sets EiE_i are disjoint in step (iii), the same is true of the sets AiA_i , and we conclude that

μ(E)=μ(A)=i=1μ(Ai)=i=1μ(Ei).\mu(E) = \mu(A) = \sum_{i=1}^{\infty} \mu(A_i) = \sum_{i=1}^{\infty} \mu(E_i).

This proves that μ\mu is countably additive on M\mathcal{M}^* . ////

1.37 The fact that functions which are equal a.e. are indistinguishable as far as integration is concerned suggests that our definition of measurable function might profitably be enlarged. Let us call a function ff defined on a set EME \in \mathfrak{M} measurable on XX if μ(Ec)=0\mu(E^c) = 0 and if f1(V)Ef^{-1}(V) \cap E is measurable for every open set VV . If we define f(x)=0f(x) = 0 for xEcx \in E^c , we obtain a measurable function on XX , in the old sense. If our measure happens to be complete, we can define ff on EcE^c in a perfectly arbitrary manner, and we still get a measurable function. The integral of ff over any set AMA \in \mathfrak{M} is independent of the definition of ff on EcE^c ; therefore this definition need not even be specified at all.

There are many situations where this occurs naturally. For instance, a function ff on the real line may be differentiable only almost everywhere (with respect to Lebesgue measure), but under certain conditions it is still true that ff is the integral of its derivative; this will be discussed in Chap. 7. Or a sequence {fn}\{f_n\} of measurable functions on XX may converge only almost everywhere; with our new definition of measurability, the limit is still a measurable function on XX , and we do not have to cut down to the set on which convergence actually occurs.

To illustrate, let us state a corollary of Lebesgue’s dominated convergence theorem in a form in which exceptional sets of measure zero are admitted:

1.38 Theorem Suppose {fn}\{f_n\} is a sequence of complex measurable functions defined a.e. on XX such that

n=1Xfndμ<.(1)\sum_{n=1}^{\infty} \int_X |f_n| \, d\mu < \infty. \quad (1)

Then the series

f(x)=n=1fn(x)(2)f(x) = \sum_{n=1}^{\infty} f_n(x) \quad (2)

converges for almost all xx , fL1(μ)f \in L^1(\mu) , and

Xfdμ=n=1Xfndμ.(3)\int_X f \, d\mu = \sum_{n=1}^{\infty} \int_X f_n \, d\mu. \quad (3)

PROOF Let SnS_n be the set on which fnf_n is defined, so that μ(Snc)=0\mu(S_n^c) = 0 . Put φ(x)=fn(x)\varphi(x) = \sum |f_n(x)| , for xS=Snx \in S = \bigcap S_n . Then μ(Sc)=0\mu(S^c) = 0 . By (1) and Theorem 1.27,

Sφdμ<.(4)\int_S \varphi \, d\mu < \infty. \quad (4)

If E={xS:φ(x)<}E = \{x \in S: \varphi(x) < \infty\} , it follows from (4) that μ(Ec)=0\mu(E^c) = 0 . The series (2) converges absolutely for every xEx \in E , and if f(x)f(x) is defined by (2) for xEx \in E , then f(x)φ(x)|f(x)| \le \varphi(x) on EE , so that fL1(μ)f \in L^1(\mu) on EE , by (4). If gn=f1++fng_n = f_1 + \cdots + f_n , then gnφ|g_n| \le \varphi , gn(x)f(x)g_n(x) \to f(x) for all xEx \in E , and Theorem 1.34 gives (3) with EE in place of XX . This is equivalent to (3), since μ(Ec)=0\mu(E^c) = 0 . ////

Note that even if the fnf_n were defined at every point of XX , (1) would only imply that (2) converges almost everywhere. Here are some other situations in which we can draw conclusions only almost everywhere:

1.39 Theorem(a) Suppose f:X[0,]f: X \to [0, \infty] is measurable, EME \in \mathcal{M} , and Efdμ=0\int_E f d\mu = 0 . Then f=0f = 0 a.e. on EE .

(b) Suppose fL1(μ)f \in L^1(\mu) and Efdμ=0\int_E f d\mu = 0 for every EME \in \mathcal{M} . Then f=0f = 0 a.e. on XX .

(c) Suppose fL1(μ)f \in L^1(\mu) and

Xfdμ=Xfdμ.\left| \int_X f d\mu \right| = \int_X |f| d\mu.

Then there is a constant α\alpha such that αf=f\alpha f = |f| a.e. on XX .

Note that (c) describes the condition under which equality holds in Theorem 1.33.

PROOF(a) If An={xE:f(x)>1/n}A_n = \{x \in E: f(x) > 1/n\} , n=1,2,3,n = 1, 2, 3, \dots , then

1nμ(An)AnfdμEfdμ=0,\frac{1}{n} \mu(A_n) \le \int_{A_n} f d\mu \le \int_E f d\mu = 0,

so that μ(An)=0\mu(A_n) = 0 . Since {xE:f(x)>0}=An\{x \in E: f(x) > 0\} = \bigcup A_n , (a) follows.

(b) Put f=u+ivf = u + iv , let E={x:u(x)0}E = \{x: u(x) \ge 0\} . The real part of Efdμ\int_E f d\mu is then Eu+dμ\int_E u^+ d\mu . Hence Eu+dμ=0\int_E u^+ d\mu = 0 , and (a) implies that u+=0u^+ = 0 a.e. We conclude similarly that

u=v+=v=0a.e.u^- = v^+ = v^- = 0 \quad \text{a.e.}

(c) Examine the proof of Theorem 1.33. Our present assumption implies that the last inequality in the proof of Theorem 1.33 must actually be an equality. Hence E(fu)dμ=0\int_E (|f| - u) d\mu = 0 . Since fu0|f| - u \ge 0 , (a) shows that f=u|f| = u a.e. This says that the real part of αf\alpha f is equal to αf|\alpha f| a.e., hence αf=αf=f\alpha f = |\alpha f| = |f| a.e., which is the desired conclusion. ////

1.40 Theorem Suppose μ(X)<\mu(X) < \infty , fL1(μ)f \in L^1(\mu) , SS is a closed set in the complex plane, and the averages

AE(f)=1μ(E)EfdμA_E(f) = \frac{1}{\mu(E)} \int_E f d\mu

lie in SS for every EME \in \mathcal{M} with μ(E)>0\mu(E) > 0 . Then f(x)Sf(x) \in S for almost all xXx \in X .

PROOF Let Δ\Delta be a closed circular disc (with center at α\alpha and radius r>0r > 0 , say) in the complement of SS . Since ScS^c is the union of countably many such discs, it is enough to prove that μ(E)=0\mu(E) = 0 , where E=f1(Δ)E = f^{-1}(\Delta) .

If we had μ(E)>0\mu(E) > 0 , then

AE(f)α=1μ(E)E(fα)dμ1μ(E)Efαdμr,|A_E(f) - \alpha| = \frac{1}{\mu(E)} \left| \int_E (f - \alpha) d\mu \right| \le \frac{1}{\mu(E)} \int_E |f - \alpha| d\mu \le r,

which is impossible, since AE(f)SA_E(f) \in S . Hence μ(E)=0\mu(E) = 0 . ////

1.41 Theorem Let {Ek}\{E_k\} be a sequence of measurable sets in XX , such that

k=1μ(Ek)<.(1)\sum_{k=1}^{\infty} \mu(E_k) < \infty. \quad (1)

Then almost all xXx \in X lie in at most finitely many of the sets EkE_k .

PROOF If AA is the set of all xx which lie in infinitely many EkE_k , we have to prove that μ(A)=0\mu(A) = 0 . Put

g(x)=k=1χEk(x)(xX).(2)g(x) = \sum_{k=1}^{\infty} \chi_{E_k}(x) \quad (x \in X). \quad (2)

For each xx , each term in this series is either 0 or 1. Hence xAx \in A if and only if g(x)=g(x) = \infty . By Theorem 1.27, the integral of gg over XX is equal to the sum in (1). Thus gL1(μ)g \in L^1(\mu) , and so g(x)<g(x) < \infty a.e. ////