Laplace distribution - Alchetron, The Free Social Encyclopedia



Parameters μ {displaystyle mu } location (real) b > 0 {displaystyle b>0} scale (real) Support x ∈ ( − ∞ ; + ∞ ) {displaystyle xin (-infty ;+infty ),} PDF 1 2 b exp ⁡ ( − \| x − μ \| b ) {displaystyle {rac {1}{2,b}}exp left(-{rac {\|x-mu \|}{b}}ight),} CDF { 1 2 exp ⁡ ( x − μ b ) if x < μ 1 − 1 2 exp ⁡ ( − x − μ b ) if x ≥ μ {displaystyle {egin{cases}{rac {1}{2}}exp left({rac {x-mu }{b}}ight)&{mbox{if }}x<mu [8pt]1-{rac {1}{2}}exp left(-{rac {x-mu }{b}}ight)&{mbox{if }}xgeq mu end{cases}}} Mean μ {displaystyle mu } Median μ {displaystyle mu }

In probability theory and statistics, the Laplace distribution is a continuous probability distribution named after Pierre-Simon Laplace. It is also sometimes called the double exponential distribution, because it can be thought of as two exponential distributions (with an additional location parameter) spliced together back-to-back, although the term 'double exponential distribution' is also sometimes used to refer to the Gumbel distribution. The difference between two independent identically distributed exponential random variables is governed by a Laplace distribution, as is a Brownian motion evaluated at an exponentially distributed random time. Increments of Laplace motion or a variance gamma process evaluated over the time scale also have a Laplace distribution.

Probability density function

A random variable has a Laplace(μ, b) distribution if its probability density function is

f ( x ∣ μ , b ) = 1 2 b exp ⁡ ( − | x − μ | b ) = 1 2 b { exp ⁡ ( − μ − x b ) if x < μ exp ⁡ ( − x − μ b ) if x ≥ μ

Here, μ is a location parameter and b > 0, which is sometimes referred to as the diversity, is a scale parameter. If μ = 0 and b = 1, the positive half-line is exactly an exponential distribution scaled by 1/2.

The probability density function of the Laplace distribution is also reminiscent of the normal distribution; however, whereas the normal distribution is expressed in terms of the squared difference from the mean μ, the Laplace density is expressed in terms of the absolute difference from the mean. Consequently, the Laplace distribution has fatter tails than the normal distribution.

Differential equation

The pdf of the Laplace distribution is a solution of the following differential equation:

{ { b f ′ ( x ) + f ( x ) = 0 f ( 0 ) = e μ b 2 b } if x ≥ μ { b f ′ ( x ) − f ( x ) = 0 f ( 0 ) = e − μ b 2 b } if x < μ

Cumulative distribution function

The Laplace distribution is easy to integrate (if one distinguishes two symmetric cases) due to the use of the absolute value function. Its cumulative distribution function is as follows:

F ( x ) = ∫ − ∞ x f ( u ) d u = { 1 2 exp ⁡ ( x − μ b ) if x < μ 1 − 1 2 exp ⁡ ( − x − μ b ) if x ≥ μ = 1 2 + 1 2 sgn ⁡ ( x − μ ) ( 1 − exp ⁡ ( − | x − μ | b ) ) .

The inverse cumulative distribution function is given by

F − 1 ( p ) = μ − b sgn ⁡ ( p − 0.5 ) ln ⁡ ( 1 − 2 | p − 0.5 | ) .

Generating random variables according to the Laplace distribution

Given a random variable U drawn from the uniform distribution in the interval (−1/2, 1/2], the random variable

X = μ − b sgn ⁡ ( U ) ln ⁡ ( 1 − 2 | U | )

has a Laplace distribution with parameters μ and b. This follows from the inverse cumulative distribution function given above.

A Laplace(0, b) variate can also be generated as the difference of two i.i.d. Exponential(1/b) random variables. Equivalently, Laplace(0,1) can also be generated as the logarithm of the ratio of two i.i.d. uniform random variables.

Parameter estimation

Given N independent and identically distributed samples x₁, x₂, ..., x_N, the maximum likelihood estimator μ ^ of μ is the sample median, and the maximum likelihood estimator of b is

b ^ = 1 N ∑ i = 1 N | x i − μ ^ |

(revealing a link between the Laplace distribution and least absolute deviations).

Moments

μ r ′ = ( 1 2 ) ∑ k = 0 r [ r ! ( r − k ) ! b k μ ( r − k ) { 1 + ( − 1 ) k } ] = m n + 1 2 b ( e m / b E − n ( m / b ) − e − m / b E − n ( − m / b ) )

where E n ( ) is the generalized exponential integral function E n ( x ) = x n − 1 Γ ( 1 − n , x ) .

If X ~ Laplace(μ, b) then kX + c ~ Laplace(kμ + c, kb).

If X ~ Laplace(0, b) then |X| ~ Exponential(b⁻¹).

If X, Y ~ Exponential(λ) then X − Y ~ Laplace(0, λ⁻¹) ．

If X ~ Laplace(μ, b) then |X − μ| ~ Exponential(b⁻¹).

If X ~ Laplace(μ, b) then X ~ EPD(μ, b, 0).

If X₁, ... X₄ ~ N(0, 1) then X₁X₂ − X₃X₄ ~ Laplace(0, 1).

If X_i ~ Laplace(μ, b) then 2 b ∑ i = 1 n | X i − μ | ∼ χ 2 ( 2 n ) (Chi-squared distribution)

If X, Y ~ Laplace(μ, b) then | X − μ | | Y − μ | ∼ F ⁡ ( 2 , 2 ) (F-distribution)

If X, Y ~ U(0, 1) then log(X/Y) ~ Laplace(0, 1).

If X ~ Exponential(λ) and Y ~ Bernoulli(0.5) independent of X, then X(2Y − 1) ~ Laplace(0, λ⁻¹).

If X ~ Exponential(λ) and Y ~ Exponential(ν) independent of X, then λX − νY ~ Laplace(0, 1) ．

If X has a Rademacher distribution and Y ~ Exp(λ) then XY ~ Laplace(0, 1/λ)

If V ~ Exponential(1) and Z ~ N(0, 1) independent of V, then X = μ + b 2 V Z ∼ L a p l a c e ( μ , b ) .

If X ~ GeometricStable(2, 0, λ, 0) then X ~ Laplace(0, λ).

The Laplace distribution is a limiting case of the hyperbolic distribution.

If X|Y ~ Normal(μ, σ = Y) with Y ~ Rayleigh(b) then X ~ Laplace(μ, b).

Relation to the exponential distribution

A Laplace random variable can be represented as the difference of two iid exponential random variables. One way to show this is by using the characteristic function approach. For any set of independent continuous random variables, for any linear combination of those variables, its characteristic function (which uniquely determines the distribution) can be acquired by multiplying the corresponding characteristic functions.

Consider two i.i.d random variables X, Y ~ Exponential(λ). The characteristic functions for X, −Y are

λ − i t + λ , λ i t + λ

respectively. On multiplying these characteristic functions (equivalent to the characteristic function of the sum of therandom variables X + (−Y)), the result is

λ 2 ( − i t + λ ) ( i t + λ ) = λ 2 t 2 + λ 2 .

This is the same as the characteristic function for Z ~ Laplace(0,1/λ), which is

1 1 + t 2 λ 2 .

Sargan distributions

Sargan distributions are a system of distributions of which the Laplace distribution is a core member. A pth order Sargan distribution has density

f p ( x ) = 1 2 exp ⁡ ( − α | x | ) 1 + ∑ j = 1 p β j α j | x | j 1 + ∑ j = 1 p j ! β j ,

for parameters α ≥ 0, β_j ≥ 0. The Laplace distribution results for p = 0.

Applications

The Laplacian distribution has been used in speech recognition to model priors on DFT coefficients and in JPEG image compression to model AC coefficients generated by a DCT.

The addition of noise drawn from a Laplacian distribution, with scaling parameter appropriate to a function's sensitivity, to the output of a statistical database query is the most common means to provide differential privacy in statistical databases.

In regression analysis, the least absolute deviations estimate arises as the maximum likelihood estimate if the errors have a Laplace distribution.

The Lasso can be thought of as a bayesian regression with a Laplacian prior.

History

This distribution is often referred to as Laplace's first law of errors. He published it in 1774 when he noted that the frequency of an error could be expressed as an exponential function of its magnitude once its sign was disregarded.

Keynes published a paper in 1911 based on his earlier thesis wherein he showed that the Laplace distribution minimised the absolute deviation from the median.

References

Laplace distribution Wikipedia

(Text) CC BY-SA

Contents