remove_rare_categorical

dataSet

List of column(s) name(s) of dataSet to transform. To transform all 
columns, set it to "auto". (character, default to "auto")

cols

share of occurencies under which row should be removed (numeric, default to 0.01)

threshold

Should the algorithm talk? (logical, default to TRUE)

verbose

Filter rows that have a rare occurences

Do most of the painful data preparation for a data science project with a minimum amount of code; Take advantages of data.table efficiency and use some algorithmic trick in order to perform data preparation in a time and RAM efficient way.

remove_rare_categorical: Filter rare categoricals

Description

Usage

Arguments

Value

Details

Examples