Learn R Programming

mlbench (version 2.1-6)

BostonHousing: Boston Housing Data

Description

Housing data for 506 census tracts of Boston from the 1970 census. The dataframe BostonHousing contains the original data by Harrison and Rubinfeld (1979), the dataframe BostonHousing2 the corrected version with additional spatial information (see references below).

Usage

data("BostonHousing", package = "mlbench")
data("BostonHousing2", package = "mlbench")

Arguments

Format

The original data are 506 observations on 14 variables, medv being the target variable:

crimper capita crime rate by town
znproportion of residential land zoned for lots over 25,000 sq.ft
indusproportion of non-retail business acres per town
chasCharles River dummy variable (= 1 if tract bounds river; 0 otherwise)
noxnitric oxides concentration (parts per 10 million)
rmaverage number of rooms per dwelling
ageproportion of owner-occupied units built prior to 1940
disweighted distances to five Boston employment centres
radindex of accessibility to radial highways
taxfull-value property-tax rate per USD 10,000
ptratiopupil-teacher ratio by town
b\(1000(B - 0.63)^2\) where \(B\) is the proportion of blacks by town
lstatpercentage of lower status of the population
medvmedian value of owner-occupied homes in USD 1000's

The corrected data set has the following additional columns:

cmedvcorrected median value of owner-occupied homes in USD 1000's
townname of town
tractcensus tract
lonlongitude of census tract
latlatitude of census tract

References

Harrison, D. and Rubinfeld, D.L. (1978). Hedonic prices and the demand for clean air. Journal of Environmental Economics and Management, 5, 81--102.

Gilley, O.W., and R. Kelley Pace (1996). On the Harrison and Rubinfeld Data. Journal of Environmental Economics and Management, 31, 403--405. [Provided corrections and examined censoring.]

Blake, C.L. & Merz, C.J. (1998). UCI Repository of Machine Learning Databases. Irvine, CA: University of California, Irvine, Department of Information and Computer Science. Formerly available from http://www.ics.uci.edu/~mlearn/MLRepository.html.

Pace, R. Kelley, and O.W. Gilley (1997). Using the Spatial Configuration of the Data to Improve Estimation. Journal of the Real Estate Finance and Economics, 14, 333--340. [Added georeferencing and spatial estimation.]

Examples

Run this code
data("BostonHousing", package = "mlbench")
summary(BostonHousing)

data("BostonHousing2", package = "mlbench")
summary(BostonHousing2)

Run the code above in your browser using DataLab