Girish Mahajan (Editor)

Pareto interpolation

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit

Pareto interpolation is a method of estimating the median and other properties of a population that follows a Pareto distribution. It is used in economics when analysing the distribution of incomes in a population, when one must base estimates on a relatively small random sample taken from the population.

The family of Pareto distributions is parameterized by

  • a positive number κ that is the smallest value that a random variable with a Pareto distribution can take. As applied to distribution of incomes, κ is the lowest income of any person in the population; and
  • a positive number θ the "Pareto index"; as this increases, the tail of the distribution gets thinner. As applied to distribution of incomes, this means that the larger the value of the Pareto index θ the smaller the proportion of incomes many times as big as the smallest incomes.
  • Pareto interpolation can be used when the available information includes the proportion of the sample that falls below each of two specified numbers a < b. For example, it may be observed that 45% of individuals in the sample have incomes below a = $35,000 per year, and 55% have incomes below b = $40,000 per year.

    Let

    Pa = proportion of the sample that lies below a;Pb = proportion of the sample that lies below b.

    Then the estimates of κ and θ are

    κ ^ = ( P b P a ( 1 / a θ ^ ) ( 1 / b θ ^ ) ) 1 / θ ^

    and

    θ ^ = log ( 1 P a ) log ( 1 P b ) log ( b ) log ( a ) .

    The estimate of the median would then be

    estimated median = κ ^ 2 1 / θ ^ ,

    since the actual population median is

    median = κ 2 1 / θ .

    References

    Pareto interpolation Wikipedia


    Similar Topics