F divergence - Alchetron, The Free Social Encyclopedia

In probability theory, an ƒ-divergence is a function D_f (P || Q) that measures the difference between two probability distributions P and Q. It helps the intuition to think of the divergence as an average, weighted by the function f, of the odds ratio given by P and Q.

Definition

Let P and Q be two probability distributions over a space Ω such that P is absolutely continuous with respect to Q. Then, for a convex function f such that f(1) = 0, the f-divergence of Q from P is defined as

D f ( P ∥ Q ) ≡ ∫ Ω f ( d P d Q ) d Q .

If P and Q are both absolutely continuous with respect to a reference distribution μ on Ω then their probability densities p and q satisfy dP = p dμ and dQ = q dμ. In this case the f-divergence can be written as

D f ( P ∥ Q ) = ∫ Ω f ( p ( x ) q ( x ) ) q ( x ) d μ ( x ) .

The f-divergences can be expressed using Taylor series and rewritten using a weighted sum of chi-type distances (Nielsen & Nock (2013)).

Instances of f-divergences

Many common divergences, such as KL-divergence, Hellinger distance, and total variation distance, are special cases of f-divergence, coinciding with a particular choice of f. The following table lists many of the common divergences between probability distributions and the f function to which they correspond (cf. Liese & Vajda (2006)).

References

F-divergence Wikipedia

(Text) CC BY-SA

Contents

Definition

Instances of f-divergences

References