Trisha Shetty (Editor)

Cantelli's inequality

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit

In probability theory, Cantelli's inequality, named after Francesco Paolo Cantelli, is a generalization of Chebyshev's inequality in the case of a single "tail". The inequality states that

Pr ( X μ λ ) { σ 2 σ 2 + λ 2 if  λ > 0 , 1 σ 2 σ 2 + λ 2 if  λ < 0.

where

X is a real-valued random variable, Pr is the probability measure, μ is the expected value of X , σ 2 is the variance of X .

Combining the cases of λ > 0 and λ < 0 gives, for δ > 0 ,

Pr ( | X μ | δ ) 2 σ 2 σ 2 + δ 2 .

The inequality is due to Francesco Paolo Cantelli. The Chebyshev inequality implies that in any data sample or probability distribution, "nearly all" values are close to the mean in terms of the absolute value of the difference between the points of the data sample and the weighted average of the data sample. The Cantelli inequality (sometimes called the "Chebyshev–Cantelli inequality" or the "one-sided Chebyshev inequality") gives a way of estimating how the points of the data sample are bigger than or smaller than their weighted average without the two tails of the absolute value estimate. The Chebyshev inequality has "higher moments versions" and "vector versions", and so does the Cantelli inequality.

Proof

  • Case λ > 0 :
  • Let X be a real-valued random variable with finite variance σ 2 and expectation μ , and define Y = X μ (so that E [ Y ] = 0 and Var ( Y ) = σ 2 ).

    Then, for any u 0 , we have

    Pr ( X μ λ ) = Pr ( Y λ ) = Pr ( Y + u λ + u ) = Pr ( ( Y + u ) 2 ( λ + u ) 2 ) E [ ( Y + u ) 2 ] ( λ + u ) 2 = σ 2 + u 2 ( λ + u ) 2 .

    the last inequality being a consequence of Markov's inequality. As the above holds for any choice of u R , we can choose to apply it with the value that minimizes the function u 0 σ 2 + u 2 ( λ + u ) 2 . By differentiating, this can be seen to be u = σ 2 λ , leading to

    Pr ( X μ λ ) σ 2 + u 2 ( λ + u ) 2 = σ 2 λ 2 + σ 2 .
  • Case λ < 0 : we proceed as before, writing α = λ > 0 and for any u 0
  • Pr ( X μ < λ ) = Pr ( Y > α ) σ 2 α 2 + σ 2 = σ 2 λ 2 + σ 2

    using the previous derivation on Y . By taking the complement, we obtain

    Pr ( X μ λ ) 1 σ 2 λ 2 + σ 2 .

    References

    Cantelli's inequality Wikipedia