Suvarna Garge (Editor)

Bernstein inequalities (probability theory)

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit

In probability theory, Bernstein inequalities give bounds on the probability that the sum of random variables deviates from its mean. In the simplest case, let X1, ..., Xn be independent Bernoulli random variables taking values +1 and −1 with probability 1/2 (this distribution is also known as the Rademacher distribution), then for every positive ε ,

Contents

P ( | 1 n i = 1 n X i | > ε ) 2 exp ( n ε 2 2 ( 1 + ε 3 ) ) .

Bernstein inequalities were proved and published by Sergei Bernstein in the 1920s and 1930s. Later, these inequalities were rediscovered several times in various forms. Thus, special cases of the Bernstein inequalities are also known as the Chernoff bound, Hoeffding's inequality and Azuma's inequality.

Some of the inequalities

1. Let X 1 , , X n be independent zero-mean random variables. Suppose that | X i | M almost surely, for all i . Then, for all positive t ,

P ( i = 1 n X i > t ) exp ( 1 2 t 2 E [ X j 2 ] + 1 3 M t ) .

2. Let X 1 , , X n be independent random variables. Suppose that for some positive real L and every integer k > 1 ,

E [ | X i k | ] 1 2 E [ X i 2 ] L k 2 k !

Then

P ( i = 1 n X i 2 t E [ X i 2 ] ) < exp ( t 2 ) , for  0 < t 1 2 L E [ X j 2 ] .

3. Let X 1 , , X n be independent random variables. Suppose that

E [ | X i k | ] k ! 4 ! ( L 5 ) k 4

for all integer k > 3 . Denote

A k = E [ X i k ] .

Then,

P ( | j = 1 n X j A 3 t 2 3 A 2 | 2 A 2 t [ 1 + A 4 t 2 6 A 2 2 ] ) < 2 exp ( t 2 ) , for  0 < t 5 2 A 2 4 L .

4. Bernstein also proved generalizations of the inequalities above to weakly dependent random variables. For example, inequality (2) can be extended as follows. X 1 , , X n be possibly non-independent random variables. Suppose that for all integer i > 0 ,

E [ X i | X 1 , , X i 1 ] = 0 , E [ X i 2 | X 1 , , X i 1 ] R i E [ X i 2 ] , E [ X i k | X 1 , , X i 1 ] 1 2 E [ X i 2 | X 1 , , X i 1 ] L k 2 k !

Then

P ( i = 1 n X i 2 t i = 1 n R i E [ X i 2 ] ) < exp ( t 2 ) , for  0 < t 1 2 L i = 1 n R i E [ X i 2 ] .

More general results for martingales can be found in Fan et al. (2015).

Proofs

The proofs are based on an application of Markov's inequality to the random variable

exp ( λ j = 1 n X j ) ,

for a suitable choice of the parameter λ > 0 .

References

Bernstein inequalities (probability theory) Wikipedia