In probability theory, concentration inequalities provide bounds on how a random variable deviates from some value (typically, its expected value). The laws of large numbers of classical probability theory state that sums of independent random variables are, under very mild conditions, close to their expectation with a large probability. Such sums are the most basic examples of random variables concentrated around their mean. Recent results shows that such behavior is shared by other functions of independent random variables.
Contents
- Markovs inequality
- Chebyshevs inequality
- Chernoff bounds
- Bounds on sums of independent variables
- EfronStein inequality
- DvoretzkyKieferWolfowitz inequality
- References
Concentration inequalities can be sorted according to how much information about the random variable is needed in order to use them.
Markov's inequality
Markov's inequality requires only the following information on a random variable X:
Then, for every constant
Markov's inequality extends to a strictly increasing and non-negative function
Chebyshev's inequality
Chebyshev's inequality requires the following information on a random variable X:
Then, for every constant a > 0:
or equivalently:
Chebyshev's inequality can be seen as a special case of the generalized Markov's inequality when
Chernoff bounds
The generic Chernoff bound requires only the moment generating function of X, defined as:
and for every
There are various Chernoff bounds for different distributions and different values of the parameter
Bounds on sums of independent variables
Let
Let
It is often interesting to bound the difference between the sum and its expected value. Several inequalities can be used.
1. Hoeffding's inequality says that:
2. The random variable
This is a generalization of Hoeffding's since it can handle other types of martingales, as well as supermartingales and submartingales.
3. The sum function,
This is a different generalization of Hoeffding's since it can handle other functions besides the sum function, as long as they change in a bounded way.
4. Bennett's inequality offers some improvement over Hoeffding's when the variances of the summands are small compared to their almost-sure bounds C. It says that:
5. The first of Bernstein's inequalities says that:
This is a generalization of Hoeffding's since it can handle not only independent variables but also weakly-dependent variables.
6. Chernoff bounds have a particularly simple form in the case of sum of independent variables, since
For example, suppose the variables
If
If
7. Similar bounds can be found in: Rademacher distribution#Bounds on sums
Efron–Stein inequality
The Efron–Stein inequality (or influence inequality, or MG bound on variance) bounds the variance of a general function.
Suppose that
Let
Dvoretzky–Kiefer–Wolfowitz inequality
The Dvoretzky–Kiefer–Wolfowitz inequality bounds the difference between the real and the empirical cumulative distribution function.
Given a natural number n, let X1, X2, …, Xn be real-valued independent and identically distributed random variables with cumulative distribution function F(·). Let Fn denote the associated empirical distribution function defined by
So
Then: