next up previous
Next: About this document

Spring 1997 John Rust
Economics 551b 37 Hillhouse, Rm. 27

Proof of the Uniform Law of Large Numbers

This note presents a self-contained proof of the uniform strong law of large numbers (ULLN). The ULLN is useful in situations where we have sample moments of functions tex2html_wrap_inline113 that depends on two arguments: a random element x and a deterministic parameter tex2html_wrap_inline117 . Suppose we observe N IID observations tex2html_wrap_inline121 from some probability distribution F(x) and we fix tex2html_wrap_inline117 at some arbitrary value in the parameter space tex2html_wrap_inline127 , then the ordinary strong law of large numbers (SLLN) states the following:

Strong Law of Large Numbers If tex2html_wrap_inline129 , then with probability 1 we have:

displaymath131

The ULLN is an extension of the SLLN that provides conditions under which tex2html_wrap_inline133 converges to tex2html_wrap_inline135 uniformly in tex2html_wrap_inline117 , i.e. conditions under which we have:

displaymath139

Equation (2) states that the maximum deviation between the random function tex2html_wrap_inline141 and the deterministic function H converges to 0: i.e. the sequence tex2html_wrap_inline147 converges uniformly in tex2html_wrap_inline117 (i.e. in sup norm) to the deterministic function H. To prove (2) it is convenient to work with the normalized functions tex2html_wrap_inline155 defined by

displaymath157

Clearly tex2html_wrap_inline159 for all tex2html_wrap_inline161 .

Uniform Strong Law of Large Numbers: Let tex2html_wrap_inline163 be IID random elements tex2html_wrap_inline165 , where X is a Borel space. Let tex2html_wrap_inline155 be a measurable function of x for all tex2html_wrap_inline117 , and a continuous function of tex2html_wrap_inline117 for almost all tex2html_wrap_inline177 . Suppose that tex2html_wrap_inline127 is compact, and that tex2html_wrap_inline181 converges to 0 with probability 1 for each tex2html_wrap_inline161 . Then if tex2html_wrap_inline187 for some function d satisfying tex2html_wrap_inline191 , then we have tex2html_wrap_inline193 with probability 1, i.e.

displaymath195

Proof: Define a function tex2html_wrap_inline197 by

displaymath199

Since g is continuous in tex2html_wrap_inline117 for almost all x it follows that for almost all x we have

displaymath209

Also, since tex2html_wrap_inline155 is dominated by d(x) uniformly in tex2html_wrap_inline117 , tex2html_wrap_inline197 is also dominated by d(x) and we can apply the Lebesgue dominated convergence theorem to show that

Similarly we can define a function tex2html_wrap_inline223 by substituting tex2html_wrap_inline225 for tex2html_wrap_inline227 in (5), and the result (7) will also hold for tex2html_wrap_inline223 . Now consider the following inequality, which holds for all N and all sequences tex2html_wrap_inline233 and all tex2html_wrap_inline235 in an tex2html_wrap_inline237 -ball tex2html_wrap_inline239 about an arbitrary point tex2html_wrap_inline161 :

displaymath243

Equation (8) implies the following result

displaymath245

Taking limits on both sides of (9) we have with probability 1:

displaymath247

Since tex2html_wrap_inline223 and tex2html_wrap_inline251 tend to 0 as tex2html_wrap_inline255 by (7), given a small tex2html_wrap_inline257 choose tex2html_wrap_inline237 sufficiently small that both of the terms in the max expression on the right hand side of (10) are less than tex2html_wrap_inline261 . Now, the collection of balls tex2html_wrap_inline263 form an open cover of tex2html_wrap_inline127 . By compactness, there is a finite subcover, tex2html_wrap_inline267 . Since inequality (10) holds in each of these balls, we must have

displaymath269

Taking intersections in (11) over a sequence tex2html_wrap_inline271 tending to 0, it follows that

displaymath273

Comment: The ULLN is actually a special case of the SLLN in Banach Spaces. That is, the functions tex2html_wrap_inline275 can be regarded as random elements in the Banach space B of all continuous, bounded functions from tex2html_wrap_inline127 into R. Furthermore, each of these random elements has mean zero, i.e. the expectation of the random function tex2html_wrap_inline275 is the zero function tex2html_wrap_inline285 , also a member of B. The SLLN in Banach spaces states the following:

SLLN for Banach Spaces Let tex2html_wrap_inline289 be IID random elements in a separable Banach space satisfying tex2html_wrap_inline291 (where tex2html_wrap_inline285 is the 0 element of B), and tex2html_wrap_inline299 . Then we have:

displaymath301

which is equivalent to

displaymath303

where tex2html_wrap_inline305 is the norm of the element tex2html_wrap_inline307 .

The ULLN emerges as a special case of the SLLN in Banach spaces by defining the Banach space B to be the space of all continuous functions from tex2html_wrap_inline127 to R, and the norm on B is defined as the supremum norm, i.e. the maximum the absolute value of the function as tex2html_wrap_inline117 ranges over tex2html_wrap_inline127 . Then we can define random elements on B by tex2html_wrap_inline323 and therefore the norm of these tex2html_wrap_inline325 is given by

displaymath327

It is easy to see that the random elements tex2html_wrap_inline325 satisfy the conditions of the Banach Space SLLN and therefore the sample average of these random elements converges with probability 1 to the zero element of B, tex2html_wrap_inline285 , i.e. the 0 function. But the convergence in norm of the sample average of the tex2html_wrap_inline337 to the tex2html_wrap_inline285 element of B is equivalent to the uniform convergence of the sample average of the random functions tex2html_wrap_inline343 to the zero function, which is precisely what the ULLN states. While this more abstract approach to proving the ULLN is conceptually simpler than the direct proof given above, the mathematics involved in proving the SLLN in Banach spaces are too advanced to be covered in this handout or in Econ 551.

Comment: In general, pointwise convergence of functions does not imply uniform convergence. A classic counterexample is the (deterministic) sequence of functions tex2html_wrap_inline345 defined over the space tex2html_wrap_inline347 by

It is clear the the sequence of functions tex2html_wrap_inline351 defined in (13) converge pointwise to the 0 function, tex2html_wrap_inline285 , but the tex2html_wrap_inline351 sequence can't converge uniformly to tex2html_wrap_inline285 since tex2html_wrap_inline361 for all N. What is going wrong here is that while each tex2html_wrap_inline345 is continuous, the functions are converging to a discontinous function equal to 1 at tex2html_wrap_inline369 and 0 for tex2html_wrap_inline373 . Another way of saying this is that the sequence tex2html_wrap_inline351 is not uniformly equicontinuous.

Definition: A collection of functions tex2html_wrap_inline377 mapping tex2html_wrap_inline127 into R is uniformly equicontinuous if for each tex2html_wrap_inline383 there exists a tex2html_wrap_inline385 such that for all tex2html_wrap_inline387 and for each tex2html_wrap_inline117 and tex2html_wrap_inline235 satisfying tex2html_wrap_inline393 we have:

displaymath395

The key idea of equicontinuity is that inequality (14) holds simultaneously for all tex2html_wrap_inline387 . There is a classical theorem of functional analysis, Ascoli's Theorem that relates uniform equicontinuity to uniform convergence:

Ascoli's Theorem: Let tex2html_wrap_inline147 be a sequence of deterministic functions from tex2html_wrap_inline127 to R, where tex2html_wrap_inline127 is a compact subset of a Euclidean space (more generally tex2html_wrap_inline127 could be a compact subset of a metric space, and in particular is allowed to be a potentially infinite-dimensional space). Then tex2html_wrap_inline147 converges uniformly to a function tex2html_wrap_inline411 if and only iff a) tex2html_wrap_inline147 converges pointwise to H, and b) tex2html_wrap_inline147 is uniformly equicontinuous. Furthermore, H is necessarily a continuous function.

Any standard textbook on functional analysis will contain a proof of Ascoli's Theorem. the proof is not difficult. We now consider a generalization of Ascoli's Theorem in the case where tex2html_wrap_inline147 is a random sequence of functions. We now need to define what we mean by strong stochastic equicontinuity:

Definition: Let tex2html_wrap_inline147 be a random sequence of functions from tex2html_wrap_inline127 to R. We say that tex2html_wrap_inline147 is weakly (uniformly) stochastically equicontinuous if

displaymath431

Definition: Let tex2html_wrap_inline147 be a random sequence of functions from tex2html_wrap_inline127 to R. We say that tex2html_wrap_inline147 is strongly (uniformly) stochastically equicontinuous (SSE) if tex2html_wrap_inline441 with probability 1 for all tex2html_wrap_inline161 and if the sequence of random functions tex2html_wrap_inline445 is weakly stochastically equicontinuous, where tex2html_wrap_inline447 is defined by tex2html_wrap_inline449 .

The following theorem can be viewed as the stochastic version of Ascoli's Theorem: it provides necessary and sufficient conditions for the strong uniform convergence of a sequence of random functions:

Theorem: Let tex2html_wrap_inline147 be a sequence of strongly uniformly stochastically equicontinuous functions from tex2html_wrap_inline127 to R, where tex2html_wrap_inline127 is a compact subset of a Euclidean space (more generally tex2html_wrap_inline127 could satisfy the weaker restriction of being totally bounded). Then tex2html_wrap_inline147 converges uniformly to a function tex2html_wrap_inline411 with probability 1 if and only iff a) tex2html_wrap_inline147 converges pointwise to H with probability 1, and b) tex2html_wrap_inline147 is strongly uniformly stochastically equicontinuous.

For a proof of this Theorem, see D. Andrews (1992) ``Generic Uniform Convergence'' Economic Theory 8, 241-247.




next up previous
Next: About this document

John Rust
Tue Feb 25 16:56:54 CST 1997