The Atkinson–Stiglitz theorem is a theorem of public economics which states "that, where the utility function is separable between labor and all commodities, no indirect taxes need be employed" if non-linear income taxation can be used by the government and was developed in a seminal article by Joseph Stiglitz and Anthony Atkinson in 1976. The Atkinson–Stiglitz theorem is generally considered to be one of the most important theoretical results in public economics and spawned a broad literature which delimited the conditions under which the theorem holds, e.g. Saez (2002) which showed that the Atkinson–Stiglitz theorem does not hold if households have heterogeneous rather than homogeneous preferences. In practice the Atkinson–Stiglitz theorem has often been invoked in the debate on optimal capital income taxation: Because capital income taxation can be interpreted as the taxation of future consumption in excess of the taxation of present consumption, the theorem implies that governments should abstain from capital income taxation if non-linear income taxation is an option since capital income taxation would not improve equity by comparison to the non-linear income tax, while additionally distorting savings.
For an individual whose wage is                     w                , its budget constraint is given by
                              ∑                      j                                    q                      j                                    x                      j                          =                  ∑                      j                          (                  x                      j                          +                  t                      j                          (                  x                      j                          )        )        =        w        L        −        T        (        w        L        )                ,                where                               q                      i                                   and                               x                      i                                   are the price and the purchase of the i-th commodity, respectively.
To maximise the utility function, the first order condition is:
                              U                      j                          =                                            (              1              +                              t                                  j                                ′                            )              (              −                              U                                  L                                            )                                      w              (              1              −                              T                ′                            )                                              (        j        =        1        ,        2        ,        .        .        .        ,        N        )        .                The government maximises the social welfare function, and so
                              ∫                      0                                ∞                                    [          w          L          −                      ∑                          j                                            x                          j                                −                                    R              ¯                                ]                d        F        =        0                .                Then we use a density function                     f                 to express the Hamiltonian:
                    H        =                  [          G          (          U          )          −          λ                      {            w            L            −                          ∑                              j                                                    x                              j                                      −                                          R                ¯                                      }                    ]                f        −        μ        θ                  U                      L                                  .                Taking its variation with regard to                               x                      j                                  , we use the condition for its maximum.
                    −        λ                  [                                    (                                                                    ∂                                          x                                              1                                                                                                  ∂                                          x                                              j                                                                                                        )                                      U                                +          1          ]                −                                            μ              θ                        f                                    [                                                                      ∂                                      2                                                  U                                            ∂                                  x                                      1                                                  ∂                L                                                                        (                                                                    ∂                                          x                                              1                                                                                                  ∂                                          x                                              j                                                                                                        )                                      U                                +                                                                      ∂                                      2                                                  U                                            ∂                                  x                                      j                                                  ∂                L                                              ]                =        0                .                Then the following relation holds:
                                          (                                                            ∂                                      x                                          1                                                                                        ∂                                      x                                          j                                                                                            )                                U                          =        −                                            U                              j                                                    U                              1                                                    =        −                                            1              +                              t                                  j                                ′                                                    1              +                              t                                  1                                ′                                                            .                Substituting this relation into the above condition yields:
                    λ                  [                                                    1                +                                  t                                      j                                    ′                                                            1                +                                  t                                      1                                    ′                                                              −          1          ]                =                                            μ              θ                              U                                  j                                                      f                                    [                                                                      ∂                                      2                                                  U                                            ∂                L                ∂                                  x                                      j                                                                                ⋅                                    1                              U                                  j                                                              −                                                                      ∂                                      2                                                  U                                            ∂                L                ∂                                  x                                      1                                                                                ⋅                                    1                              U                                  1                                                              ]                =                                            μ              θ                              U                                  j                                                      f                                                ∂                          ∂              L                                                (          ln                                              U                              j                                              −          ln                                              U                              1                                              )                        ,                and we obtain
                    λ                  [                                                    1                +                                  t                                      j                                    ′                                                            1                +                                  t                                      1                                    ′                                                              −          1          ]                =                                            μ              θ                              U                                  j                                                      f                                                ∂                          ∂              L                                                (          ln                                                              U                                  j                                                            U                                  1                                                              )                        .                Note that there is no loss of generality in setting                               t                      1                    ′                         zero, therefore we put                               t                      1                    ′                =        0                . Since                               U                      j                          =        (        1        +                  t                      j                    ′                )        α                , we have
                                                        t                              j                            ′                                      1              +                              t                                  j                                ′                                                    =                                            μ              θ              α                                      λ              f                                                            ∂                          ∂              L                                                (          ln                                                              U                                  j                                                            U                                  1                                                              )                        .                Thus it turns out that no indirect taxation need to be employed, i.e.                               t                      j                          =        0                , provided that the utility function is weakly separable between labour and all consumption goods.
Joseph Stiglitz explains why indirect taxation is unnecessary, viewing the Atkinson-Stiglitz threorem from a different perspective.
Suppose that those who are in category 2 are the more able. Then, for Pareto efficient taxation at which a government aims, we impose two conditions. The first condition is that the utility of category 1 is equal to or more than a given level:
                                                        U              ¯                                            1                          ≤                  V                      1                          (                  C                      1                          ,                  Y                      1                          )                .                The second condition is that the government revenue                     R                , which is equal to or more than the revenue requirement                                           R            ¯                                  , is increased by a given amount:
                    R        =        −        (                  C                      1                          −                  Y                      1                          )                  N                      1                          −        (                  C                      2                          −                  Y                      2                          )                  N                      2                                  ,                                                          R            ¯                          ≤        R                ,                where                               N                      1                                   and                               N                      2                                   indicate the number of individuals of each type. Under these conditions, the government needs to maximise the utility                               V                      2                          (                  C                      2                          ,                  Y                      2                          )                 of category 2. Then writing down the Lagrange function for this problem:
                                          L                          =                  V                      2                          (                  C                      2                          ,                  Y                      2                          )        +        μ                  V                      1                          (                  C                      1                          ,                  Y                      1                          )        +                  λ                      2                          (                  V                      2                          (                  C                      2                          ,                  Y                      2                          )        −                  V                      2                          (                  C                      1                          ,                  Y                      1                          )        )        +                  λ                      1                          (                  V                      1                          (                  C                      1                          ,                  Y                      1                          )        −                  V                      1                          (                  C                      2                          ,                  Y                      2                          )        )        +        γ                  (          −          (                      C                          1                                −                      Y                          1                                )                      N                          1                                −          (                      C                          2                                −                      Y                          2                                )                      N                          2                                −                                    R              ¯                                )                        ,                which ensures satisfaction of the self-selection constraints, we obtain the first order conditions:
                    μ                                            ∂                              V                                  1                                                                    ∂                              C                                  1                                                                    −                  λ                      2                                                              ∂                              V                                  2                                                                    ∂                              C                                  1                                                                    +                  λ                      1                                                              ∂                              V                                  1                                                                    ∂                              C                                  1                                                                    −        γ                  N                      1                          =        0                ,                                    μ                                            ∂                              V                                  1                                                                    ∂                              Y                                  1                                                                    −                  λ                      2                                                              ∂                              V                                  2                                                                    ∂                              Y                                  1                                                                    +                  λ                      1                                                              ∂                              V                                  1                                                                    ∂                              Y                                  1                                                                    +        γ                  N                      1                          =        0                ,                                                                        ∂                              V                                  2                                                                    ∂                              C                                  2                                                                    +                  λ                      2                                                              ∂                              V                                  2                                                                    ∂                              C                                  2                                                                    −                  λ                      1                                                              ∂                              V                                  1                                                                    ∂                              C                                  2                                                                    −        γ                  N                      2                          =        0                ,                                                                        ∂                              V                                  2                                                                    ∂                              Y                                  2                                                                    +                  λ                      2                                                              ∂                              V                                  2                                                                    ∂                              Y                                  2                                                                    −                  λ                      1                                                              ∂                              V                                  1                                                                    ∂                              Y                                  2                                                                    +        γ                  N                      2                          =        0                .                For the case where                               λ                      1                          =        0                 and                               λ                      2                          =        0                , we have
                                                        ∂                              V                                  i                                                            /                            ∂                              Y                                  i                                                                    ∂                              V                                  i                                                            /                            ∂                              C                                  i                                                                    +        1        =        0                ,                for                     i        =        1        ,        2                , and therefore the government can achieve a lump-sum taxation. For the case where                               λ                      1                          =        0                 and                               λ                      2                          >        0                , we have
                                                        ∂                              V                                  2                                                            /                            ∂                              Y                                  2                                                                    ∂                              V                                  2                                                            /                            ∂                              C                                  2                                                                    +        1        =        0                ,                and we find that the marginal tax rate for category 2 is zero. And as to category 1, we have
                                                        ∂                              V                                  1                                                            /                            ∂                              Y                                  2                                                                    ∂                              V                                  1                                                            /                            ∂                              C                                  1                                                                    =        −                                            1              −                              λ                                  2                                            (              ∂                              V                                  2                                                            /                            ∂                              Y                                  1                                            )                              /                                            N                                  1                                            γ                                      1              +                              λ                                  2                                            (              ∂                              V                                  2                                                            /                            ∂                              C                                  1                                            )                              /                                            N                                  1                                            γ                                              .                If we put                               δ                      i                          =                                            ∂                              V                                  i                                                            /                            ∂                              Y                                  1                                                                    ∂                              V                                  i                                                            /                            ∂                              C                                  1                                                                            ,                (        i        =        1        ,        2        )                , then the marginal tax rate for category 1 is                               δ                      1                          +        1                .
Also, we have the following expression:
                              δ                      1                          =        −                  (                                                    1                −                ν                                  δ                                      2                                                                              1                +                ν                                              )                        ,                where we denote                     ν                 by
                    ν        =                                                            λ                                  2                                            (              ∂                              V                                  2                                                            /                            ∂                              C                                  1                                            )                                                      N                                  1                                            γ                                              .                Therefore, by assumption,                               δ                      1                          <                  δ                      2                                  , and so we can directly prove that                     −        1        <                  δ                      1                          <                  δ                      2                                  . Accordingly, we find that the marginal tax rate for category 1 is positive.
For the case where                               λ                      1                          >        0                 and                               λ                      2                          =        0                , the marginal tax rate for category 2 is negative. The lump-sum tax imposed on an individual of category 1 would become larger than that for category 2, if the lump-sum tax were feasible.
Now we need to consider a case where income level and several commodities are observals. Each individual's consumption function is expressed in a vector form as
                                                        C                                            1                          =                  ∑                      j                                    C                      1            j                                                              e                                            j                                                                                          C                                            2                          =                  ∑                      j                                    C                      2            j                                                              e                                            j                                  .                In this case, the government's budget constraint is
                    R        ≤                  ∑                      k            =            1                                2                          (                  Y                      k                                    N                      k                          )        −                  N                      1                                    ∑                      j                                    C                      1            j                          −                  N                      2                                    ∑                      j                                    C                      2            j                                  .                Then we have
                    μ                                            ∂                              V                                  1                                                                    ∂                              C                                  1                  j                                                                    −                  λ                      2                                                              ∂                              V                                  2                                                                    ∂                              C                                  1                  j                                                                    +                  λ                      1                                                              ∂                              V                                  1                                                                    ∂                              C                                  1                  j                                                                    −        γ                  N                      1                          =        0                ,                                    μ                                            ∂                              V                                  1                                                                    ∂                              Y                                  1                                                                    −                  λ                      2                                                              ∂                              V                                  2                                                                    ∂                              Y                                  1                                                                    +                  λ                      1                                                              ∂                              V                                  1                                                                    ∂                              Y                                  1                                                                    +        γ                  N                      1                          =        0                ,                                                                        ∂                              V                                  2                                                                    ∂                              C                                  2                  j                                                                    +                  λ                      2                                                              ∂                              V                                  2                                                                    ∂                              C                                  2                  j                                                                    −                  λ                      1                                                              ∂                              V                                  1                                                                    ∂                              C                                  2                  j                                                                    −        γ                  N                      2                          =        0                ,                                                                        ∂                              V                                  2                                                                    ∂                              Y                                  2                                                                    +                  λ                      2                                                              ∂                              V                                  2                                                                    ∂                              Y                                  2                                                                    −                  λ                      1                                                              ∂                              V                                  1                                                                    ∂                              Y                                  2                                                                    +        γ                  N                      2                          =        0                .                Here we restricting ourselves to the case where                               λ                      1                          =        0                 and                               λ                      2                          >        0                . It follows that
                                                                        ∂                                  V                                      2                                                                              ∂                                  C                                      2                    j                                                                                                      ∂                                  V                                      2                                                                              ∂                                  C                                      2                    n                                                                                      =        1                ,                                                                    ∂                                  V                                      2                                                                              ∂                                  C                                      2                    j                                                                                                      ∂                                  V                                      2                                                                              ∂                                  Y                                      2                                                                                      =        1                .                Suppose all individuals have the same indifference curve in C-L plane. The separability between leisure and consumption enables us to have                                                                         ∂                                  2                                                            U                                  k                                                                    ∂                              C                                  k                  j                                            ∂                              L                                  k                                                                    =        0                ,                 which yields
                                                        ∂                              V                                  1                                                                    ∂                              C                                  1                  j                                                                    =                                            ∂                              V                                  2                                                                    ∂                              C                                  1                  j                                                                            .                As a result, we obtain
                                                                        ∂                                  V                                      1                                                                              ∂                                  C                                      1                    j                                                                                                      ∂                                  V                                      1                                                                              ∂                                  C                                      1                    n                                                                                      =        1                .                Thus we find that it is unnecessary to impose taxes on commodities.
Conditions for randomization
We need to consider a case where high ability individuals (who usually earn more money to show their ability) pretend to be like they are not more able. In this case, it could be argued that the government needs to randomize the taxes imposed on the low ability individuals, for the purpose of increasing the effectiveness of screening. It is possible that under certain conditions we can do the randomization of the taxes without damaging the low ability individuals, and therefore we discuss the conditions. For the case where an individual chooses to show his ability, we see a tax schedule be related to                     {                  C                      2                                ∗                          ,                  Y                      2                                ∗                          }                . For the case where an individual chooses to hide his ability, we see one of two tax schedules:                     {                  C                      1                                ∗                          ,                  Y                      1                                ∗                          }                 and                     {                  C                      1                                ∗            ∗                          ,                  Y                      1                                ∗            ∗                          }                . The randomization is done so that the risk of the former case should differ from that of the latter.
To avoid hitting the low ability group, the mean consumption must be shifted upwards at each                     Y                . As the compsumption is maximized, a higher                                                         C              ¯                                            1                                   is set for a higher                                                         Y              ¯                                            1                                  . Then the relations between those variables are
                              C                      1                                ∗                          =                                            C              ¯                                            1                          +        h                ,                          Y                      1                                ∗                          =                                            Y              ¯                                            1                          +        λ        h                                              C                      1                                ∗            ∗                          =                                            C              ¯                                            1                          −        h                ,                          Y                      1                                ∗            ∗                          =                                            Y              ¯                                            1                          −        λ        h                .                The utility function is                               V                      2                          (                  C                      1                                ∗                          ,                  Y                      1                                ∗                          )                 and                               V                      2                          (                  C                      1                                ∗            ∗                          ,                  Y                      1                                ∗            ∗                          )                , and we have the condition for the optimum:
                              V                      2                          C                              ∗                                                    (        d                                            C              ¯                                            1                          +        d        h        )        +                  V                      2                          Y                              ∗                                                    (        d                                            Y              ¯                                            1                          +        λ        d        h        )        +                  V                      2                          C                              ∗                ∗                                                    (        d                                            C              ¯                                            1                          −        d        h        )        +                  V                      2                          Y                              ∗                ∗                                                    (        d                                            Y              ¯                                            1                          −        λ        d        h        )        =        0                ,                and likewise
                              V                      1                          C                              ∗                                                    (        d                                            C              ¯                                            1                          +        d        h        )        +                  V                      1                          Y                              ∗                                                    (        d                                            Y              ¯                                            1                          +        λ        d        h        )        +                  V                      1                          C                              ∗                ∗                                                    (        d                                            C              ¯                                            1                          −        d        h        )        +                  V                      1                          Y                              ∗                ∗                                                    (        d                                            Y              ¯                                            1                          −        λ        d        h        )        =        0                .                And accordingly we have
                                          [                                                            S                                      V                                          2                      C                                                                                        S                                      V                                          2                      Y                                                                                                                    S                                      V                                          1                      C                                                                                        S                                      V                                          1                      Y                                                                                            ]                                                [                                                            d                                                            C                      ¯                                                                                                                    d                                                            Y                      ¯                                                                                            ]                          =        −                              [                                                            D                                      V                                          2                      C                                                        +                  λ                  D                                      V                                          2                      Y                                                                                                                    D                                      V                                          1                      C                                                        +                  λ                  D                                      V                                          1                      C                                                                                            ]                          d        h                ,                where                     S                  V                      k            C                          =                  V                      k                          C                              ∗                                                    +                  V                      k                          C                              ∗                ∗                                                             and                     S                  V                      k            Y                          =                  V                      k                          Y                              ∗                                                    +                  V                      k                          Y                              ∗                ∗                                                             and                     k        =        1        ,        2                . Similarly                     D                  V                      k            C                          =                  V                      k                          C                              ∗                                                    −                  V                      k                          C                              ∗                ∗                                                             and                     D                  V                      k            Y                          =                  V                      k                          Y                              ∗                                                    −                  V                      k                          Y                              ∗                ∗                                                            .
Then we have
                              lim                      h            →            0                                                              d              (                                                Y                  ¯                                            −                                                C                  ¯                                            )                                      d              h                                      =                                                            F                                  1                                            −                              F                                  2                                                                    (              −              2              )              (              M              R                              S                                  1                                            −              M              R                              S                                  2                                            )                                              ,                where                     M        R                  S                      k                          =        −        (                                            ∂                              V                                  k                                                                    ∂                              C                                  1                                                                              )                      −            1                                                              ∂                              V                                  k                                                                    ∂                              Y                                  1                                                                            . As to                               F                      1                          ,                  F                      2                                   we denote them by                               F                      1                          =        (                                            ∂                              V                                  2                                                                    ∂                              C                                  1                                                                              )                      −            1                                    M                      2                          (        1        −        M        R                  S                      1                          )                 and                               F                      2                          =        (                                            ∂                              V                                  1                                                                    ∂                              C                                  1                                                                              )                      −            1                                    M                      1                          (        1        −        M        R                  S                      2                          )                . Also we define                               M                      k                                   by                               M                      k                          =        D                  V                      k            C                          +        λ        D                  V                      k            Y                                  . But the first derivative of                                           Y            ¯                          −                              C            ¯                                   with regard to                     h                , at                     h        =        0                , is zero (because                               M                      k                          =        0                ), and so we need to calculate its second derivative.
                                                                        d                                  2                                            (                                                Y                  ¯                                            −                                                C                  ¯                                            )                                      d                              h                                  2                                                                    =                  H                      1                          +                  H                      2                                  ,                where                               H                      1                          =                                            d              (                              F                                  1                                            −                              F                                  2                                            )                                      d              h                                                            1                          −              2              (              M              R                              S                                  1                                            −              M              R                              S                                  2                                            )                                               and                               H                      2                          =        (        −        1        )                                            d              (                                                Y                  ¯                                            −                                                C                  ¯                                            )                                      d              h                                                                          d              ln                                            (                −                2                )                (                M                R                                  S                                      1                                                  −                M                R                                  S                                      2                                                  )                                                    d              h                                              . And so                               H                      2                                   vanishes at                     h        =        0                . Then we have
                                                                        d                                  2                                            (                                                Y                  ¯                                            −                                                C                  ¯                                            )                                      d                              h                                  2                                                                    =                                                            I                                  1                                            +                              I                                  2                                                                    (              −              1              )              (              M              R                              S                                  1                                            −              M              R                              S                                  2                                            )                                                      .                                              I                      1                          =        (                  V                      2            C            C                          +        2        λ                  V                      2            C            Y                          +                  λ                      2                                    V                      2            Y            Y                          )        (                                            ∂                              V                                  2                                                                    ∂                              C                                  1                                                                              )                      −            1                          (        1        −        M        R                  S                      1                          )                                              I                      2                          =        (        −        1        )        (                  V                      1            C            C                          +        2        λ                  V                      1            C            Y                          +                  λ                      2                                    V                      1            Y            Y                          )        (                                            ∂                              V                                  1                                                                    ∂                              C                                  1                                                                              )                      −            1                          (        1        −        M        R                  S                      2                          )                Since                     M        R                  S                      2                          <        M        R                  S                      1                          <        1                , we obtain the condition under which randomization is desirable:
                    (                  V                      2            C            C                          +        2        λ                  V                      2            C            Y                          +                  λ                      2                                    V                      2            Y            Y                          )        (                  V                      1                          C                              1                                                    +                  V                      2                          Y                              1                                                    )        −        (                  V                      1            C            C                          +        2        λ                  V                      1            C            Y                          +                  λ                      2                                    V                      2            Y            Y                          )        (                  V                      2                          C                              1                                                    +                  V                      2                          Y                              1                                                    )        <        0                .