miss_var_summary: Summarise the missingness in each variable
Description
Provide a summary for each variable of the number, percent missings, and
cumulative sum of missings of the order of the variables. By default,
it orders by the most missings in each variable.
Usage
miss_var_summary(data, order = FALSE, add_cumsum = FALSE, digits, ...)
Value
a tibble of the percent of missing data in each variable
Arguments
data
a data.frame
order
a logical indicating whether to order the result by n_miss.
Defaults to TRUE. If FALSE, order of variables is the order input.
add_cumsum
logical indicating whether or not to add the cumulative
sum of missings to the data. This can be useful when exploring patterns
of nonresponse. These are calculated as the cumulative sum of the missings
in the variables as they are first presented to the function.
digits
how many digits to display in pct_miss column. Useful when
you are working with small amounts of missing data.
miss_var_summary(airquality)
miss_var_summary(oceanbuoys, order = TRUE)
if (FALSE) {
# works with group_by from dplyrlibrary(dplyr)
airquality %>%
group_by(Month) %>%
miss_var_summary()
}