A dataset containing information about some randomly generated people,
created using the excellent wakefield
package. It is created as
deliberately messy dataset.
typical_data
A data frame with 5000 rows and 11 variables:
Unique identifier for each individual, a sequential character vector of zero-padded identification numbers (IDs). see ?wakefield::id
Race for each individual, "Black", "White", "Hispanic", "Asian", "Other", "Bi-Racial", "Native", and "Hawaiin", see ?wakefield::race
Age of each individual, see ?wakefield::age
Male or female, see ?wakefield::sex
Height in centimeters, see ?wakefield::height
vector of intelligence quotients (IQ), see ?wakefield::iq
whether or not this person smokes, see ?wakefield::smokes
Yearly income in dollars, see ?wakefield::income
Whether or not this person has died yet., see ?wakefield::died