The first-difference (FD) estimator is an approach used to address the problem of omitted variables in econometrics and statistics with panel data. The estimator is obtained by running a pooled OLS estimation for a regression of Δ y i t on Δ x i t .
The FD estimator wipes out time invariant omitted variables c i using the repeated observations over time:
y i t = x i t β + c i + u i t , t = 1 , . . . T , y i t − 1 = x i t − 1 β + c i + u i t − 1 , t = 2 , . . . T . Differencing both equations, gives:
Δ y i t = y i t − y i t − 1 = Δ x i t β + Δ u i t , t = 2 , . . . T , which removes the unobserved c i .
The FD estimator β ^ F D is then simply obtained by regressing changes on changes using OLS:
β ^ F D = ( Δ X ′ Δ X ) − 1 Δ X ′ Δ y Note that the rank condition must be met for Δ X ′ Δ X to be invertible ( r a n k [ Δ X ′ Δ X ] = k ).
Similarly,
A v a ^ r ( β ^ F D ) = σ ^ u 2 ( Δ X ′ Δ X ) − 1 , where σ ^ u 2 is given by
σ ^ u 2 = [ n ( T − 1 ) − K ] − 1 u ^ ′ u ^ . Under the assumption of E [ u i t − u i t − 1 | x i t − x i t − 1 ] = 0 , the FD estimator is unbiased and consistent, i.e. E [ β ^ F D ] = β and p l i m β ^ = β . Note that this assumption is less restrictive than the assumption of weak exogeneity required for unbiasedness using the fixed effects (FE) estimator. If the disturbance term u i t follows a random walk, the usual OLS standard errors are asymptotically valid.
For T = 2 , the FD and fixed effects estimators are numerically equivalent.
Under the assumption of spherical errors, i.e. homoscedasticity and no serial correlation in u i t , the FE estimator is more efficient than the FD estimator. If u i t follows a random walk, however, the FD estimator is more efficient as Δ u i t are serially uncorrelated.
In practice, the FD estimator is easier to implement without special software, as the only transformation required is to first difference.