Fano's inequality - Alchetron, The Free Social Encyclopedia

In information theory, Fano's inequality (also known as the Fano converse and the Fano lemma) relates the average information lost in a noisy channel to the probability of the categorization error. It was derived by Robert Fano in the early 1950s while teaching a Ph.D. seminar in information theory at MIT, and later recorded in his 1961 textbook.

Alternative formulation

Let X be a random variable with density equal to one of r + 1 possible densities f 1 , … , f r + 1 . Furthermore, the Kullback–Leibler divergence between any pair of densities cannot be too large,

D K L ( f i ∥ f j ) ≤ β for all i ≠ j .

Let ψ ( X ) ∈ { 1 , … , r + 1 } be an estimate of the index. Then

sup i P i ( ψ ( X ) ≠ i ) ≥ 1 − β + log ⁡ 2 log ⁡ r

where P i is the probability induced by f i

Generalization

The following generalization is due to Ibragimov and Khasminskii (1979), Assouad and Birge (1983).

Let F be a class of densities with a subclass of r + 1 densities ƒ_θ such that for any θ ≠ θ′

∥ f θ − f θ ′ ∥ L 1 ≥ α , D K L ( f θ ∥ f θ ′ ) ≤ β .

Then in the worst case the expected value of error of estimation is bound from below,

sup f ∈ F E ∥ f n − f ∥ L 1 ≥ α 2 ( 1 − n β + log ⁡ 2 log ⁡ r )

where ƒ_n is any density estimator based on a sample of size n.

References

Fano's inequality Wikipedia

(Text) CC BY-SA

Contents

Alternative formulation

Generalization

References