Find unique values and the counts of those variables for a set of variables in a data.frame. Useful in PMI analysis and for exploring document metadata.
get_unique_values_and_counts(metadata, variable_names = NA, threshold = 0)
A data.frame containing document covariates.
A string or vector of strings givign the column names of covariates we would like to get unique values for.
Defaults to 0, the number of times a unique value of a variable must appear in order to be included in the returned list object. Allows the user to ignore very infrequent values.
A list of vectors of unique variable values that occur more than "threshold" times.