Learn R Programming

modeldata (version 1.3.0)

taxi: Chicago taxi data set

Description

A data set containing information on a subset of taxi trips in the city of Chicago in 2022.

Arguments

Value

tibble

Details

The source data are originally described on the linked City of Chicago data portal. The data exported here are a pre-processed subset motivated by the modeling problem of predicting whether a rider will tip or not.

tip

Whether the rider left a tip. A factor with levels "yes" and "no".

distance

The trip distance, in odometer miles.

company

The taxi company, as a factor. Companies that occurred few times were binned as "other".

local

Whether the trip's starting and ending locations are in the same community. See the source data for community area values.

dow

The day of the week in which the trip began, as a factor.

month

The month in which the trip began, as a factor.

hour

The hour of the day in which the trip began, as a numeric.

Examples

Run this code
# \donttest{
taxi
# }

Run the code above in your browser using DataLab