Originally published at Inside the Mind of the G Machine. You can comment here or there.
I always like to think of understanding of the delta epsilon definition of limit as somewhat of an ideal dividing line on the cognitive hierarchy, between actually smart and pseudo smart. I still remember vividly struggling to grok that back in high school when I first saw it junior year, though summer after, it made sense, as for why it was reasonable to define it that way. That such was only established in the 19th century goes to show how unnatural such abstract precise definitions are for the human brain (more reason to use cognitive genomics to enhance it 😉 ). At that time, I would not have imagined easily that this limit definition could be generalized further, discarding the deltas and epsilons, which presumes and restricts to real numbers, as it already felt abstract enough. Don’t even get me started on topological spaces, nets, filters, and ultrafilters; my understanding of them is still cursory at best, but someday I will fully internalize them.
Fortunately, I have improved significantly since then, both in terms of experience and in terms of my biological intelligence, that last night, I was able to reconstruct in my head the proof of the Arzela-Ascoli theorem, which also had been once upon a time mind-boggling. Remarkably, that proof, viewed appropriately, comes naturally out of just a few simple, critical observations.
The statement of the theorem is as follows.
Arzela-Ascoli theorem Let be a family of functions from
to
that are uniformly bounded and equicontinuous. Then there is a sequence of
of elements in
that converges uniformly in
.
The rationals are dense in the reals and make an excellent starting point. Uniform boundedness enables us employ Bolzano-Weierstrass to construct a sequence of functions convergent at any rational in . With a countable number of such rationals, we can iteratively filter this sequence to construct one that converges at every rational in
. Essentially, we have an enumeration of the rationals in
and a chain of subsequences based on that, where in the
th subsequence is convergent at the first
rationals in the enumeration, and Bolzano-Weierstrass is applied onto the results of the application of the functions of that subsequence on the
th rational to yield another subsequence. Take, diagonally, the
th function in the
th subsequence, to derive our desired uniformly convergent sequence, which we call
.
To show uniform convergence, it suffices to show uniform Cauchyness, namely that for any , there is an
such that
implies
for all
. By compactness, open neighborhoods of all rationals of
, as an open cover, has a finite subcover. Each element of the subcover comes from some rational of
and across that finite subset of
we can for any
take the max of all the
s for convergence. This means that so long as our neighborhoods are sufficiently small, we can for any point
have some
that is the point of focus of the neighborhood of our finite subcover containing
and thereby connect
to
by equicontinuity and use our maximum
(over finite elements) to connect that to
and use equicontinuity again to connect that to
. Thus, triangle inequality over three of
suffices.
More explicitly, equicontinuity-wise, we have for every some open neighborhood of
such that
implies that
.