The raw data behind the story "A Statistical Analysis of the Work of Bob Ross" https://fivethirtyeight.com/features/a-statistical-analysis-of-the-work-of-bob-ross/. An analysis using this data was contributed by Jonathan Bouchet as a package vignette at https://fivethirtyeight-r.netlify.com/articles/bob_ross.html.
bob_ross
A data frame with 403 rows representing episodes and 71 variables:
Episode code
Season number
Episode number
Title of episode
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
Present (1) or not (0)
# NOT RUN {
# To convert data frame to tidy data (long) format, run:
library(dplyr)
library(tidyr)
library(stringr)
bob_ross_tidy <- bob_ross %>%
gather(object, present, -c(episode, season, episode_num, title)) %>%
mutate(present = as.logical(present)) %>%
arrange(episode, object)
# }
Run the code above in your browser using DataLab