Rahul Sharma (Editor)

Borel–Cantelli lemma

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit

In probability theory, the Borel–Cantelli lemma is a theorem about sequences of events. In general, it is a result in measure theory. It is named after Émile Borel and Francesco Paolo Cantelli, who gave statement to the lemma in the first decades of the 20th century. A related result, sometimes called the second Borel–Cantelli lemma, is a partial converse of the first Borel–Cantelli lemma. The lemma states that, under certain conditions, an event will have probability of either zero or one. As such, it is the best-known of a class of similar theorems, known as zero-one laws. Other examples include the Kolmogorov 0-1 law and the Hewitt–Savage zero-one law.

Contents

Statement of lemma for probability spaces

Let E1,E2,... be a sequence of events in some probability space. The Borel–Cantelli lemma states:

If the sum of the probabilities of the En is finite then the probability that infinitely many of them occur is 0, that is,

Here, "lim sup" denotes limit supremum of the sequence of events, and each event is a set of outcomes. That is, lim sup En is the set of outcomes that occur infinitely many times within the infinite sequence of events (En). Explicitly,

lim sup n E n = n = 1 k n E k .

The theorem therefore asserts that if the sum of the probabilities of the events En is finite, then the set of all outcomes that are "repeated" infinitely many times must occur with probability zero. Note that no assumption of independence is required.

Example

Suppose (Xn) is a sequence of random variables with Pr(Xn = 0) = 1/n2 for each n. The probability that Xn = 0 occurs for infinitely many n is equivalent to the probability of the intersection of infinitely many [Xn = 0] events. The intersection of infinitely many such events is a set of outcomes common to all of them. However, the sum ∑Pr(Xn = 0) converges to π2/6 ≈ 1.645 < ∞, and so the Borel–Cantelli Lemma states that the set of outcomes that are common to infinitely many such events occurs with probability zero. Hence, the probability of Xn = 0 occurring for infinitely many n is 0. Almost surely (i.e., with probability 1), Xn is nonzero for all but finitely many n.

Proof

Let [ E n ] denote the indicator function of the event E n (using Iverson bracket notation). Then, by the linearity of expectation

E ( n [ E n ] ) = n E ( [ E n ] ) = n Pr ( E n ) <

by hypothesis. This directly implies that

Pr ( n [ E n ] = ) = 0

because otherwise

E ( n [ E n ] ) { n [ E n ] = } ( n [ E n ] ) d Pr = .

Alternative proof

Let (En) be a sequence of events in some probability space and suppose that the sum of the probabilities of the En is finite. That is suppose:

n = 1 Pr ( E n ) < .

Now we can examine the series by examining the elements in the series. We can order the sequence such that the smaller the element is, the later it would come in the sequence. That is :

Pr ( E i ) Pr ( E i + 1 ) .

As the series converges, we must have that

n = N Pr ( E n ) 0 , as  N .

Therefore :

inf N 1 n = N Pr ( E n ) = 0.

Therefore it follows that

Pr ( lim sup n E n ) = Pr ( infinitely many of the  E n  occur ) = Pr ( N = 1 n = N E n ) inf N 1 Pr ( n = N E n ) inf N 1 n = N Pr ( E n ) = 0

General measure spaces

For general measure spaces, the Borel–Cantelli lemma takes the following form:

Let μ be a (positive) measure on a set X, with σ-algebra F, and let (An) be a sequence in F. If then

Converse result

A related result, sometimes called the second Borel–Cantelli lemma, is a partial converse of the first Borel–Cantelli lemma. The lemma states: If the events En are independent and the sum of the probabilities of the En diverges to infinity, then the probability that infinitely many of them occur is 1. That is:

The assumption of independence can be weakened to pairwise independence, but in that case the proof is more difficult.

Example

The infinite monkey theorem is a special case of this lemma.

The lemma can be applied to give a covering theorem in Rn. Specifically (Stein 1993, Lemma X.2.1), if Ej is a collection of Lebesgue measurable subsets of a compact set in Rn such that

j μ ( E j ) = ,

then there is a sequence Fj of translates

F j = E j + x j

such that

lim sup F j = n = 1 k = n F k = R n

apart from a set of measure zero.

Proof

Suppose that n = 1 Pr ( E n ) = and the events ( E n ) n = 1 are independent. It is sufficient to show the event that the En's did not occur for infinitely many values of n has probability 0. This is just to say that it is sufficient to show that

1 Pr ( lim sup n E n ) = 0.

Noting that:

1 Pr ( lim sup n E n ) = 1 Pr ( { E n  i.o. } ) = Pr ( { E n  i.o. } c ) = Pr ( ( N = 1 n = N E n ) c ) = Pr ( N = 1 n = N E n c ) = Pr ( lim inf n E n c ) = lim N Pr ( n = N E n c )

it is enough to show: Pr ( n = N E n c ) = 0 . Since the ( E n ) n = 1 are independent:

Pr ( n = N E n c ) = n = N Pr ( E n c ) = n = N ( 1 Pr ( E n ) ) n = N exp ( Pr ( E n ) ) = exp ( n = N Pr ( E n ) ) = 0.

This completes the proof. Alternatively, we can see Pr ( n = N E n c ) = 0 by taking negative the logarithm of both sides to get:

log ( Pr ( n = N E n c ) ) = log ( n = N ( 1 Pr ( E n ) ) ) = n = N log ( 1 Pr ( E n ) ) .

Since −log(1 − x) ≥ x for all x > 0, the result similarly follows from our assumption that n = 1 Pr ( E n ) = .

Counterpart

Another related result is the so-called counterpart of the Borel–Cantelli lemma. It is a counterpart of the Lemma in the sense that it gives a necessary and sufficient condition for the limsup to be 1 by replacing the independence assumption by the completely different assumption that ( A n ) is monotone increasing for sufficiently large indices. This Lemma says:

Let ( A n ) be such that A k A k + 1 , and let A ¯ denote the complement of A . Then the probability of infinitely many A k occur (that is, at least one A k occurs) is one if and only if there exists a strictly increasing sequence of positive integers ( t k ) such that

k Pr ( A t k + 1 A ¯ t k ) = .

This simple result can be useful in problems such as for instance those involving hitting probabilities for stochastic process with the choice of the sequence ( t k ) usually being the essence.

References

Borel–Cantelli lemma Wikipedia