The Tobit model is a statistical model proposed by James Tobin (1958) to describe the relationship between a non-negative dependent variable
Contents
- Etymology
- Consistency
- Interpretation
- Variations of the Tobit model
- Type I
- Type II
- Type III
- Type IV
- Type V
- The likelihood function
- Non Parametric Version
- Applications
- References
The model supposes that there is a latent (i.e. unobservable) variable
where
Etymology
When asked why it was called the "Tobit" model, instead of Tobin, James Tobin explained that this term was introduced by Arthur Goldberger, either as a contraction of "Tobin probit", or as a reference to the novel The Caine Mutiny, a novel by Tobin's friend Herman Wouk, in which Tobin makes a cameo as "Mr Tobit". Tobin reports having actually asked Goldberger which it was, and the man refused to say.
Consistency
If the relationship parameter
Interpretation
The
Variations of the Tobit model
Variations of the Tobit model can be produced by changing where and when censoring occurs. Amemiya (1985, p. 384) classifies these variations into five categories (Tobit type I - Tobit type V), where Tobit type I stands for the first model described above. Schnedler (2005) provides a general formula to obtain consistent likelihood estimators for these and other variations of the Tobit model.
Type I
The Tobit model is a special case of a censored regression model, because the latent variable
Another example is censoring of values above
Yet another model results when
The rest of the models will be presented as being bounded from below at 0, though this can be generalized as done for Type I.
Type II
Type II Tobit models introduce a second latent variable.
Heckman (1987) falls into the Type II Tobit. In Type I Tobit, the latent variable absorb both the process of participation and 'outcome' of interest. Type II Tobit allows the process of participation/selection and the process of 'outcome' to be independent, conditional on x.
Type III
Type III introduces a second observed dependent variable.
The Heckman model falls into this type.
Type IV
Type IV introduces a third observed dependent variable and a third latent variable.
Type V
Similar to Type II, in Type V only the sign of
The likelihood function
Below are the likelihood and log likelihood functions for a type I Tobit. This is a Tobit that is censored from below at
Next, let
and the log likelihood is given by
Note that this is different from the likelihood function of the truncated regression model.
Non-Parametric Version
If the underlying latent variable
Applications
Tobit models have, for example, been applied to estimate factors that impact grant receipt, including financial transfers distributed to sub-national governments who may apply for these grants. In these cases, grant recipients cannot receive negative amounts, and the data is this left-censored. For instance, Dahlberg and Johansson (2002) analyse a sample of 115 municipalities (42 of which received a grant). Dubois and Fattore (2011) use a Tobit model to investigate the role of various factors in European Union fund receipt by applying Polish sub-national governments. The data may however be left-censored at a point higher than zero, with the risk of mis-specification. Both studies apply Probit and other models to check for robustness.