Binary response model with continuous endogenous explanatory variables

Given a probit model y=1[y* > 0] where y* = x₁ β + zδ + u, and u ~ N(0,1), without losing generality, z can be represented as z = x₁ θ₁ + x₂ θ₂ + v. When u is correlated with v, there will be an issue of endogeneity. This can be caused by omitted variables and measurement errors. There are also many cases where z is partially determined by y and endogeneity issue arises. For instance, in a model evaluating the effect of different patient features on their choice of whether going to hospital, y is the choice and z is the amount of the medicine a respondent took, then it is very intuitive that more often the respondent goes to hospital, it is more likely that she took more medicine, hence endogeneity issue arises. When there are endogenous explanatory variables, the estimator generated by usual estimation procedure will be inconsistent, then the corresponding estimated Average Partial Effect (APE) will be inconsistent, too.

To address this issue, there are usually two different estimation procedure to generate consistent estimators. Under the normality assumption v~N(0,σ²), u = ρv + ε must hold, where ρ = cov(u , v)/σ² and ε~N(0,1-ρ² σ²). Then the equation for y^* can be rewritten as y^* = x₁ β + zδ + ρv + ε.

This model can be consistently estimated by 2-Stage Least Square (2SLS):

1) Regress z on (x₁, x₂) and obtain the consistent estimator θ ^ and the residual v ^ ;

2) Estimate the binary response model on (x₁, z, v ^ ) and get the consistent estimator for the scaled coefficients (β_ρσ, δ_ρσ, ρ_ρσ) ≡ (β, δ, ρ)/√(1 - ρ² σ² );

Then (y = 1│x, z) = Φ ( x₁ β ^ _ρσ + z δ ^ _ρσ + ρ ^ _ρσ v ^ ) . Since the APE of variable w ~ at ( x ~ , z ~ ) is given by

E_v [ ∂ ϕ ( x 1 β ρ σ + z δ ρ σ + ρ ρ σ v ) ∂ w ~ | ( x ~ , z ~ )]

By Law of Large Number, a consistent estimator is given as

1 N ∑ i = 1 n ∂ ϕ ( x 1 β ^ ρ σ + z δ ^ ρ σ + ρ ^ ρ σ v ^ i ) ∂ w ~ | ( x ~ , z ~ )

This model can also be consistently estimated by conditional Maximum Likelihood Method. Because P(y, z│x) = P (y│z, x) P (z| x) where P (y│x, z) is given by

[ ϕ ( x 1 ( β − ρ θ 1 ) − x 2 ρ θ 2 + z ( δ + ρ ) ( 1 − ρ 2 σ 2 ) ] y [ 1 − ϕ ( x 1 ( β − ρ θ 1 ) − x 2 ρ θ 2 + z ( δ + ρ ) ( 1 − ρ 2 σ 2 ) ] 1 − y

and P(z│x) is given by ϕ ( z − x 1 θ 1 − x 2 θ 2 σ )

Then the log-likelihood function for maximization is given by:

∑ i = 1 n y i log ⁡ [ ϕ ( x 1 i ( β − ρ θ 1 ) − x 2 i ρ θ 2 + z i ( δ + ρ ) ( 1 − ρ 2 σ 2 ) ]

+ ( 1 − y i ) log ⁡ [ 1 − ϕ ( x 1 i ( β − ρ θ 1 ) − x 2 i ρ θ 2 + z i ( δ + ρ ) ( 1 − ρ 2 σ 2 ) ]

+ log ⁡ [ ϕ ( z − x 1 θ 1 − x 2 θ 2 σ ) ]

Once the consistent estimators are obtained, APE can be calculated following the same procedure given above. All the discussion above is mainly about the probit model. When the distribution assumption is changed, the same logic still applies.

References

Binary response model with continuous endogenous explanatory variables Wikipedia

(Text) CC BY-SA