Parameters λ > 0 (real) | ||
![]() | ||
Support k ∈ N ∪ 0 {displaystyle kin mathbb {N} cup 0} ; pmf λ k e − λ k ! {displaystyle {rac {lambda ^{k}e^{-lambda }}{k!}}} CDF Γ ( ⌊ k + 1 ⌋ , λ ) ⌊ k ⌋ ! {displaystyle {rac {Gamma (lfloor k+1floor ,lambda )}{lfloor kfloor !}}} , or e − λ ∑ i = 0 ⌊ k ⌋ λ i i ! {displaystyle e^{-lambda }sum _{i=0}^{lfloor kfloor }{rac {lambda ^{i}}{i!}} } , or Q ( ⌊ k + 1 ⌋ , λ ) {displaystyle Q(lfloor k+1floor ,lambda )} (for k ≥ 0 {displaystyle kgeq 0} , where Γ ( x , y ) {displaystyle Gamma (x,y)} is the incomplete gamma function, ⌊ k ⌋ {displaystyle lfloor kfloor } is the floor function, and Q is the regularized gamma function) Mean λ {displaystyle lambda } Median ≈ ⌊ λ + 1 / 3 − 0.02 / λ ⌋ {displaystyle approx lfloor lambda +1/3-0.02/lambda floor } |
In probability theory and statistics, the Poisson distribution (French pronunciation [pwasɔ̃]; in English usually /ˈpwɑːsɒn/), named after French mathematician Siméon Denis Poisson, is a discrete probability distribution that expresses the probability of a given number of events occurring in a fixed interval of time and/or space if these events occur with a known average rate and independently of the time since the last event. The Poisson distribution can also be used for the number of events in other specified intervals such as distance, area or volume.
Contents
- Introduction to the Poisson distribution
- Examples
- Assumptions When is the Poisson distribution an appropriate model
- Probability of events for a Poisson distribution
- Examples of probability for Poisson distributions
- Once in an interval events The special case of 1 and k 0
- Examples that violate the Poisson assumptions
- Poisson regression and negative binomial regression
- History
- Definition
- Mean
- Median
- Higher moments
- Sums of Poisson distributed random variables
- Other properties
- Poisson races
- Related distributions
- Occurrence
- Law of rare events
- Poisson point process
- Other applications in science
- Generating Poisson distributed random variables
- Maximum likelihood
- Confidence interval
- Bayesian inference
- Simultaneous estimation of multiple Poisson means
- Bivariate Poisson distribution
- Poisson distribution using R
- Poisson distribution using Excel
- Poisson distribution using Python SciPy
- Poisson distribution using Mathematica
- References
For instance, an individual keeping track of the amount of mail they receive each day may notice that they receive an average number of 4 letters per day. If receiving any particular piece of mail doesn't affect the arrival times of future pieces of mail, i.e., if pieces of mail from a wide range of sources arrive independently of one another, then a reasonable assumption is that the number of pieces of mail received per day obeys a Poisson distribution. Other examples that may follow a Poisson: the number of phone calls received by a call center per hour or the number of decay events per second from a radioactive source.
Introduction to the Poisson distribution
The Poisson distribution is popular for modelling the number of times an event occurs in an interval of time or space.
Examples
The Poisson distribution may be useful to model events such as
Assumptions: When is the Poisson distribution an appropriate model?
The Poisson distribution is an appropriate model if the following assumptions are true.
If these conditions are true, then K is a Poisson random variable, and the distribution of K is a Poisson distribution.
Probability of events for a Poisson distribution
An event can occur 0, 1, 2, … times in an interval. The average number of events in an interval is designated
where
This equation is the probability mass function (PMF) for a Poisson distribution.
Examples of probability for Poisson distributions
On a particular river, overflow floods occur once every 100 years on average. Calculate the probability of k = 0, 1, 2, 3, 4, 5, or 6 overflow floods in a 100-year interval, assuming the Poisson model is appropriate.
Because the average event rate is one overflow flood per 100 years, λ = 1
The table below gives the probability for 0 to 6 overflow floods in a 100-year period.
Ugarte and colleagues report that the average number of goals in a World Cup soccer match is approximately 2.5 and the Poisson model is appropriate.
Because the average event rate is 2.5 goals per match, λ = 2.5.
The table below gives the probability for 0 to 7 goals in a match.
Once in an interval events: The special case of λ = 1 and k = 0
Suppose that astronomers estimate that large meteors (above a certain size) hit the earth on average once every 100 years (λ = 1 event per 100 years), and that the number of meteor hits follows a Poisson distribution. What is the probability of k = 0 meteor hits in the next 100 years?
Under these assumptions, the probability that no large meteors hit the earth in the next 100 years is p = 0.37. The remaining 1 – 0.37 = 0.63 is the probability of 1, 2, 3, or more large meteor hits in the next 100 years. In an example above, an overflow flood occurred once every 100 years (λ = 1). The probability of no overflow floods in 100 years was p = 0.37, by the same calculation.
In general, if an event occurs once per interval (λ = 1), and the events follow a Poisson distribution, then P(k = 0 events in next interval) = 0.37.
As it happens, P(exactly one event in next interval) = 0.37, as shown in the table for overflow floods.
Examples that violate the Poisson assumptions
The number of students who arrive at the student union per minute will likely not follow a Poisson distribution, because the rate is not constant (low rate during class time, high rate between class times) and the arrivals of individual students are not independent (students tend to come in groups).
The number of magnitude 5 earthquakes per year in California may not follow a Poisson distribution if one large earthquake increases the probability of aftershocks of similar magnitude.
Among patients admitted to the intensive care unit of a hospital, the number of days that the patients spend in the ICU is not Poisson distributed because the number of days cannot be zero. The distribution may be modeled using a Zero-truncated Poisson distribution.
Count distributions in which the number of intervals with zero events is higher than predicted by a Poisson model may be modeled using a Zero-inflated model.
Poisson regression and negative binomial regression
Poisson regression and negative binomial regression are useful for analyses where the dependent (response) variable is the count (0, 1, 2, …) of the number of events or occurrences in an interval.
History
The distribution was first introduced by Siméon Denis Poisson (1781–1840) and published, together with his probability theory, in 1837 in his work Recherches sur la probabilité des jugements en matière criminelle et en matière civile ("Research on the Probability of Judgments in Criminal and Civil Matters"). The work theorized about the number of wrongful convictions in a given country by focusing on certain random variables N that count, among other things, the number of discrete occurrences (sometimes called "events" or "arrivals") that take place during a time-interval of given length. The result had been given previously by Abraham de Moivre (1711) in De Mensura Sortis seu; de Probabilitate Eventuum in Ludis a Casu Fortuito Pendentibus in Philosophical Transactions of the Royal Society, p. 219. This makes it an example of Stigler's law and it has prompted some authors to argue that the Poisson distribution should bear the name of de Moivre.
A practical application of this distribution was made by Ladislaus Bortkiewicz in 1898 when he was given the task of investigating the number of soldiers in the Prussian army killed accidentally by horse kicks; this experiment introduced the Poisson distribution to the field of reliability engineering.
Definition
A discrete random variable X is said to have a Poisson distribution with parameter λ > 0, if, for k = 0, 1, 2, ..., the probability mass function of X is given by:
where
The positive real number λ is equal to the expected value of X and also to its variance
The Poisson distribution can be applied to systems with a large number of possible events, each of which is rare. How many such events will occur during a fixed time interval? Under the right circumstances, this is a random number with a Poisson distribution.
The conventional definition of the Poisson distribution contains two terms that can easily overflow on computers: λk and k!. The fraction of λk to k! can also produce a rounding error which is very large compared to e−λ, and therefore give an erroneous result. For numerical stability the Poisson probability mass function should therefore be evaluated as
which is mathematically equivalent but numerically stable. The natural logarithm of the Gamma function can be obtained using the lgamma
function in the C (programming language) standard library (C99 version), the gammaln
function in MATLAB or SciPy, or the log_gamma
function in Fortran 2008 and later.
Mean
Median
Bounds for the median (ν) of the distribution are known and are sharp:
Higher moments
Sums of Poisson-distributed random variables
IfOther properties
Poisson races
Let
The upper bound is proved using a standard Chernoff bound.
The lower bound can be proved by noting that
Related distributions
Occurrence
Applications of the Poisson distribution can be found in many fields related to counting:
The Poisson distribution arises in connection with Poisson processes. It applies to various phenomena of discrete properties (that is, those that may happen 0, 1, 2, 3, ... times during a given period of time or in a given area) whenever the probability of the phenomenon happening is constant in time or space. Examples of events that may be modelled as a Poisson distribution include:
Gallagher in 1976 showed that the counts of prime numbers in short intervals obey a Poisson distribution provided a certain version of an unproved conjecture of Hardy and Littlewood is true.
Law of rare events
The rate of an event is related to the probability of an event occurring in some small subinterval (of time, space or otherwise). In the case of the Poisson distribution, one assumes that there exists a small enough subinterval for which the probability of an event occurring twice is "negligible". With this assumption one can derive the Poisson distribution from the Binomial one, given only the information of expected number of total events in the whole interval. Let this total number be
In several of the above examples—such as, the number of mutations in a given sequence of DNA—the events being counted are actually the outcomes of discrete trials, and would more precisely be modelled using the binomial distribution, that is
In such cases n is very large and p is very small (and so the expectation np is of intermediate magnitude). Then the distribution may be approximated by the less cumbersome Poisson distribution
This approximation is sometimes known as the law of rare events, since each of the n individual Bernoulli events rarely occurs. The name may be misleading because the total count of success events in a Poisson process need not be rare if the parameter np is not small. For example, the number of telephone calls to a busy switchboard in one hour follows a Poisson distribution with the events appearing frequent to the operator, but they are rare from the point of view of the average member of the population who is very unlikely to make a call to that switchboard in that hour.
The word law is sometimes used as a synonym of probability distribution, and convergence in law means convergence in distribution. Accordingly, the Poisson distribution is sometimes called the law of small numbers because it is the probability distribution of the number of occurrences of an event that happens rarely but has very many opportunities to happen. The Law of Small Numbers is a book by Ladislaus Bortkiewicz (Bortkevitch) about the Poisson distribution, published in 1898.
Poisson point process
The Poisson distribution arises as the number of points of a Poisson point process located in some finite region. More specifically, if D is some region space, for example Euclidean space Rd, for which |D|, the area, volume or, more generally, the Lebesgue measure of the region is finite, and if N(D) denotes the number of points in D, then
Other applications in science
In a Poisson process, the number of observed occurrences fluctuates about its mean λ with a standard deviation
The correlation of the mean and standard deviation in counting independent discrete occurrences is useful scientifically. By monitoring how the fluctuations vary with the mean signal, one can estimate the contribution of a single occurrence, even if that contribution is too small to be detected directly. For example, the charge e on an electron can be estimated by correlating the magnitude of an electric current with its shot noise. If N electrons pass a point in a given time t on the average, the mean current is
An everyday example is the graininess that appears as photographs are enlarged; the graininess is due to Poisson fluctuations in the number of reduced silver grains, not to the individual grains themselves. By correlating the graininess with the degree of enlargement, one can estimate the contribution of an individual grain (which is otherwise too small to be seen unaided). Many other molecular applications of Poisson noise have been developed, e.g., estimating the number density of receptor molecules in a cell membrane.
In Causal Set theory the discrete elements of spacetime follow a Poisson distribution in the volume.
Generating Poisson-distributed random variables
A simple algorithm to generate random Poisson-distributed numbers (pseudo-random number sampling) has been given by Knuth (see References below):
algorithm poisson random number (Knuth): init: Let L ← e−λ, k ← 0 and p ← 1. do: k ← k + 1. Generate uniform random number u in [0,1] and let p ← p × u. while p > L. return k − 1.While simple, the complexity is linear in the returned value k, which is λ on average. There are many other algorithms to overcome this. Some are given in Ahrens & Dieter, see References below. Also, for large values of λ, there may be numerical stability issues because of the term e−λ. This could be solved by a slight change to allow λ to be added into the calculation gradually:
algorithm poisson random number (Junhao, based on Knuth): init: Let λLeft ← λ, k ← 0 and p ← 1. do: k ← k + 1. Generate uniform random number u in (0,1) and let p ← p × u. if p < e and λLeft > 0: if λLeft > STEP: p ← p × eSTEP λLeft ← λLeft - STEP else: p ← p × eλLeft λLeft ← -1 while p > 1. return k − 1.The choice of STEP depends on the threshold of overflow. For double precision floating point format, the threshold is near e700, so 500 shall be a safe STEP.
Other solutions for large values of λ include rejection sampling and using Gaussian approximation.
Inverse transform sampling is simple and efficient for small values of λ, and requires only one uniform random number u per sample. Cumulative probabilities are examined in turn until one exceeds u.
algorithm Poisson generator based upon the inversion by sequential search: init: Let x ← 0, p ← e−λ, s ← p. Generate uniform random number u in [0,1]. while u > s do: x ← x + 1. p ← p * λ / x. s ← s + p. return x."This algorithm ... requires expected time proportional to λ as λ→∞. For large λ, round-off errors proliferate, which provides us with another reason for avoiding large values of λ."
Maximum likelihood
Given a sample of n measured values ki = 0, 1, 2, ..., for i = 1, ..., n, we wish to estimate the value of the parameter λ of the Poisson population from which the sample was drawn. The maximum likelihood estimate is
Since each observation has expectation λ so does this sample mean. Therefore, the maximum likelihood estimate is an unbiased estimator of λ. It is also an efficient estimator, i.e. its estimation variance achieves the Cramér–Rao lower bound (CRLB). Hence it is minimum-variance unbiased. Also it can be proved that the sum (and hence the sample mean as it is a one-to-one function of the sum) is a complete and sufficient statistic for λ.
To prove sufficiency we may use the factorization theorem. Consider partitioning the probability mass function of the joint Poisson distribution for the sample into two parts: one that depends solely on the sample
Note that the first term,
To find the parameter λ that maximizes the probability function for the Poisson population, we can use the logarithm of the likelihood function:
We take the derivative of
Solving for λ gives a stationary point.
So λ is the average of the ki values. Obtaining the sign of the second derivative of L at the stationary point will determine what kind of extreme value λ is.
Evaluating the second derivative at the stationary point gives:
which is the negative of n times the reciprocal of the average of the ki. This expression is negative when the average is positive. If this is satisfied, then the stationary point maximizes the probability function.
For completeness, a family of distributions is said to be complete if and only if
For this equality to hold, it is obvious that
Confidence interval
The confidence interval for the mean of a Poisson distribution can be expressed using the relationship between the cumulative distribution functions of the Poisson and chi-squared distributions. The chi-squared distribution is itself closely related to the gamma distribution, and this leads to an alternative expression. Given an observation k from a Poisson distribution with mean μ, a confidence interval for μ with confidence level 1 – α is
or equivalently,
where
When quantiles of the Gamma distribution are not available, an accurate approximation to this exact interval has been proposed (based on the Wilson–Hilferty transformation):
where
For application of these formulae in the same context as above (given a sample of n measured values ki each drawn from a Poisson distribution with mean λ), one would set
calculate an interval for μ = nλ, and then derive the interval for λ.
Bayesian inference
In Bayesian inference, the conjugate prior for the rate parameter λ of the Poisson distribution is the gamma distribution. Let
denote that λ is distributed according to the gamma density g parameterized in terms of a shape parameter α and an inverse scale parameter β:
Then, given the same sample of n measured values ki as before, and a prior of Gamma(α, β), the posterior distribution is
The posterior mean E[λ] approaches the maximum likelihood estimate
The posterior predictive distribution for a single additional observation is a negative binomial distribution, sometimes called a Gamma–Poisson distribution.
Simultaneous estimation of multiple Poisson means
Suppose
In this case, a family of minimax estimators is given for any
Bivariate Poisson distribution
This distribution has been extended to the bivariate case. The generating function for this distribution is
with
The marginal distributions are Poisson(θ1) and Poisson(θ2) and the correlation coefficient is limited to the range
A simple way to generate a bivariate Poisson distribution
Poisson distribution using R
The R function dpois(x, lambda)
calculates the probability that there are x events in an interval, where the argument "lambda" is the average number of events per interval.
For example,
dpois(x=0, lambda=1) = 0.3678794
dpois(x=1,lambda=2.5) = 0.2052125
The following R code creates a graph of the Poisson distribution from x= 0 to 8, with lambda=2.5.
x=0:8
px = dpois(x, lambda=2.5)
plot(x, px, type="h", xlab="Number of events k", ylab="Probability of k events", ylim=c(0,0.5), pty="s", main="Poisson distribution Probability of events for lambda = 2.5")
Poisson distribution using Excel
The Excel function POISSON( x, mean, cumulative )
calculates the probability of x events where mean is lambda, the average number of events per interval. The argument cumulative specifies the cumulative distribution.
For example,
=POISSON(0, 1, FALSE)
= 0.3678794
=POISSON(1, 2.5, FALSE)
= 0.2052125
Poisson distribution using Python (SciPy)
The function scipy.stats.distributions.poisson.pmf(x, poissonLambda)
calculates the probability that there are x events in an interval, where the argument "poissonLambda" is the average number of events per interval.
Poisson distribution using Mathematica
Mathematica supports the univariate Poisson distribution as PoissonDistribution[