Method
Consider the model:
Yt=β0 + β1Xt+αPt+εt
Pt=Zt+νt
The observed data consist of Yt, Xt and Pt,
while Zt, εt,
and νt are unobserved. The endogeneity problem arises from
the correlation of Pt with the structural error
εt,
since E(εν)≠0.
The requirement for the structural and measurement error is to have mean zero, but no restriction is imposed on their distribution.
Let S̅ be the sample mean of a variable St
and Gt=G(Xt) for any given function \(G\) that
has finite third own and cross moments. Lewbel(1997) proves that the following instruments can be constructed and used with two-stage least squares to obtain consistent estimates:
q1t=(Gt-G̅)(3a)
q2t=(Gt-G̅)(Pt-P̅)(3b)
q3t=(Gt-G̅)(Yt-Y̅)(3c)
q4t=(Yt-Y̅)(Pt-P̅)(3d)
q5t=(Pt-P̅)2(3e)
q6t=(Yt-Y̅)2(3f)
Instruments in equations \(3e\) and \(3f\) can be used only when the measurement and the structural errors are symmetrically distributed.
Otherwise, the use of the instruments does not require any distributional assumptions for the errors. Given that the regressors \(G(X) = X\)
are included as instruments, \(G(X)\) should not be linear in \(X\) in equation \(3a\).
Let small letter denote deviation from the sample mean: si = Si-S̅.
Then, using as instruments the variables presented in equations \(3\) together with \(1\) and
Xt, the two-stage-least-squares estimation will provide consistent estimates for the parameters
in equation \(1\) under the assumptions exposed in Lewbel(1997).
Formula parameter
The formula
argument follows a four part notation:
A two-sided formula describing the model (e.g. y ~ X1 + X2 + P
), a single endogenous regressor
(e.g. P
), and the exogenous variables from which the internal instrumental variables should
be build (e.g. IIV(iiv=y2)
), each part separated by a single vertical bar (|
).
The instrumental variables that should be built are specified as (multiple) functions, one for each
instrument. This function is IIV
and uses the following arguments:
iiv
Which internal instrument to build. One of g, gp, gy, yp, p2, y2
can be chosen.
g
Which function g
represents in iiv
.
One of x2, x3, lnx, 1/x
can be chosen.
Only required if the type of internal instrument demands it.
...
The exogenous regressors to build the internal instrument. If more than one is given,
separate instruments are built for each. Only required if the type of internal instrument demands it.
Note that no argument to IIV
is to be supplied as character but as symbols without quotation marks.
Optionally, additional external instrumental variables to also include in the instrumental variable
regression can be specified. These external instruments have to be already present in the data
and are provided as the fourth right-hand side part of the formula, again separated by a vertical bar.
See the example section for illustrations on how to specify the formula
parameter.