Consider the model below:
$$ Y_{t} = \beta_{0}+ \gamma^{'}X_{t} + \alpha P_{t}+\epsilon_{t} \hspace{0.3cm} (1) $$
$$ P_{t} = Z_{t}+\nu_{t} \hspace{2.5 cm} (2)$$
The observed data consist of \(Y_{t}\), \(X_{t}\) and \(P_{t}\), while \(Z_{t}\), \(\epsilon_{t}\), and \(\nu_{t}\)
are unobserved. The endogeneity problem arises from the correlation of \(P_{t}\) with the structural error, \(\epsilon_{t}\),
since \(E(\epsilon \nu)\neq 0\).
The requirement for the structural and measurement error is to have mean zero, but no restriction is imposed on their distribution.
Let \(\bar{S}\) be the sample mean of a variable \(S_{t}\) and \(G_{t} = G(X_{t})\) for any given function \(G\) that
has finite third own and cross moments. Lewbel(1997) proves that the following instruments can be constructed and used with 2SLS to obtain consistent estimates:
$$ q_{1t}=(G_{t} - \bar{G}) \hspace{1.6 cm}(3a)$$
$$ q_{2t}=(G_{t} - \bar{G})(P_{t}-\bar{P}) \hspace{0.3cm} (3b) $$
$$ q_{3t}=(G_{t} - \bar{G})(Y_{t}-\bar{Y}) \hspace{0.3cm} (3c)$$
$$ q_{4t}=(Y_{t} - \bar{Y})(P_{t}-\bar{P}) \hspace{0.3cm} (3d)$$
$$ q_{5t}=(P_{t}-\bar{P})^{2} \hspace{1.5 cm} (3e) $$
$$ q_{6t}=(Y_{t} - \bar{Y})^{2}\hspace{1.5 cm} (3f)$$
Instruments in equations 3e
and 3f
can be used only when the measurement and the structural errors are symmetrically distributed.
Otherwise, the use of the instruments does not require any distributional assumptions for the errors. Given that the regressors \(G(X) = X\)
are included as instruments, \(G(X)\) should not be linear in \(X\) in equation 3a
.
Let small letter denote deviation from the sample mean: \(s_{i} = S_{i}-\bar{S}\). Then, using as instruments the variables presented in
equations 3
together with 1
and \(X_{t}\), the two-stage-least-squares estimation will provide consistent estimates for the parameters
in equation 1
under the assumptions exposed in Lewbel(1997).