Dehejia and Wahba (1999) sample of data from Lalonde (1986). This data set includes 185 treated units from the National Supported Work (NSW) program, paired with 2490 control units drawn from the Panel Study of Income Dynamics (PSID-1).
The treatment variable of interest is nsw
, which indicates that an individual
was in the job training program. The main outcome of interest is
real earnings in 1978 (re78
). The remaining variables are characteristics
of the individuals, to be used as controls.
lalonde
A data frame with 2675 rows and 14 columns.
treatment indicator: participation in the National Supported Work program.
real earnings in 1978 (outcome)
unemployed in 1978; actually an indicator for zero income in 1978
age in years
indicator for identifying as black
indicator for identifying as Hispanic
factor for self-identified race/ethnicity; same information as black
and hisp
in character form.
indicator for being married
real income in 1974
real income in 1975
unemployment in 1974; actually an indicator for zero income in 1974
unemployment in 1975; actually an indicator for zero income in 1975
Years of education of the individual
indicator for no high school degree; actually an indicator for years of education less than 12
Dehejia, Rajeev H., and Sadek Wahba. "Causal effects in non-experimental studies: Reevaluating the evaluation of training programs." Journal of the American statistical Association 94.448 (1999): 1053-1062.
LaLonde, Robert J. "Evaluating the econometric evaluations of training programs with experimental data." The American economic review (1986): 604-620.