Trisha Shetty (Editor)

Gibbs' inequality

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Gibbs' inequality

In information theory, Gibbs' inequality is a statement about the mathematical entropy of a discrete probability distribution. Several other bounds on the entropy of probability distributions are derived from Gibbs' inequality, including Fano's inequality. It was first presented by J. Willard Gibbs in the 19th century.

Contents

Gibbs' inequality

Suppose that

P = { p 1 , , p n }

is a probability distribution. Then for any other probability distribution

Q = { q 1 , , q n }

the following inequality between positive quantities (since the pi and qi are positive numbers less than one) holds

i = 1 n p i log 2 p i i = 1 n p i log 2 q i

with equality if and only if

p i = q i

for all i. Put in words, the information entropy of a distribution P is less than or equal to its cross entropy with any other distribution Q.

The difference between the two quantities is the Kullback–Leibler divergence or relative entropy, so the inequality can also be written:

D K L ( P Q ) i = 1 n p i log 2 p i q i 0.

Note that the use of base-2 logarithms is optional, and allows one to refer to the quantity on each side of the inequality as an "average surprisal" measured in bits.

Proof

Since

log 2 a = ln a ln 2

it is sufficient to prove the statement using the natural logarithm (ln). Note that the natural logarithm satisfies

ln x x 1

for all x > 0 with equality if and only if x=1.

Let I denote the set of all i for which pi is non-zero. Then

i I p i ln q i p i i I p i ( q i p i 1 ) = i I q i + i I p i 0.

So

i I p i ln q i i I p i ln p i

and then trivially

i = 1 n p i ln q i i = 1 n p i ln p i

since the right hand side does not grow, but the left hand side may grow or may stay the same.

For equality to hold, we require:

  1. q i p i = 1 for all i I so that the approximation ln q i p i = q i p i 1 is exact.
  2. i I q i = 1 so that equality continues to hold between the third and fourth lines of the proof.

This can happen if and only if

p i = q i

for i = 1, ..., n.

Alternative proofs

The result can alternatively be proved using Jensen's inequality or log sum inequality.

Corollary

The entropy of P is bounded by:

H ( p 1 , , p n ) log n .

The proof is trivial - simply set q i = 1 / n for all i.

References

Gibbs' inequality Wikipedia