Cantelli's inequality - Alchetron, The Free Social Encyclopedia

In probability theory, Cantelli's inequality, named after Francesco Paolo Cantelli, is a generalization of Chebyshev's inequality in the case of a single "tail". The inequality states that

Pr ( X − μ ≥ λ ) { ≤ σ 2 σ 2 + λ 2 if λ > 0 , ≥ 1 − σ 2 σ 2 + λ 2 if λ < 0.

where

X is a real-valued random variable, Pr is the probability measure, μ is the expected value of X , σ 2 is the variance of X .

Combining the cases of λ > 0 and λ < 0 gives, for δ > 0 ,

Pr ( | X − μ | ≥ δ ) ≤ 2 σ 2 σ 2 + δ 2 .

The inequality is due to Francesco Paolo Cantelli. The Chebyshev inequality implies that in any data sample or probability distribution, "nearly all" values are close to the mean in terms of the absolute value of the difference between the points of the data sample and the weighted average of the data sample. The Cantelli inequality (sometimes called the "Chebyshev–Cantelli inequality" or the "one-sided Chebyshev inequality") gives a way of estimating how the points of the data sample are bigger than or smaller than their weighted average without the two tails of the absolute value estimate. The Chebyshev inequality has "higher moments versions" and "vector versions", and so does the Cantelli inequality.

Proof

Case λ > 0 :

Let X be a real-valued random variable with finite variance σ 2 and expectation μ , and define Y = X − μ (so that E [ Y ] = 0 and Var ⁡ ( Y ) = σ 2 ).

Then, for any u ≥ 0 , we have

Pr ( X − μ ≥ λ ) = Pr ( Y ≥ λ ) = Pr ( Y + u ≥ λ + u ) = Pr ( ( Y + u ) 2 ≥ ( λ + u ) 2 ) ≤ E [ ( Y + u ) 2 ] ( λ + u ) 2 = σ 2 + u 2 ( λ + u ) 2 .

the last inequality being a consequence of Markov's inequality. As the above holds for any choice of u ∈ R , we can choose to apply it with the value that minimizes the function u ≥ 0 ↦ σ 2 + u 2 ( λ + u ) 2 . By differentiating, this can be seen to be u ∗ = σ 2 λ , leading to

Pr ( X − μ ≥ λ ) ≤ σ 2 + u ∗ 2 ( λ + u ∗ ) 2 = σ 2 λ 2 + σ 2 .

Case λ < 0 : we proceed as before, writing α = − λ > 0 and for any u ≥ 0

Pr ( X − μ < λ ) = Pr ( − Y > α ) ≤ σ 2 α 2 + σ 2 = σ 2 λ 2 + σ 2

using the previous derivation on − Y . By taking the complement, we obtain

Pr ( X − μ ≥ λ ) ≥ 1 − σ 2 λ 2 + σ 2 .

References

Cantelli's inequality Wikipedia

(Text) CC BY-SA