Merges factor levels that occur only infrequently into combined levels with a higher frequency.
mergeSmallFactorLevels(task, cols = NULL, min.perc = 0.01,
new.level = ".merged")
(Task) The task.
(character) Which columns to convert. Default is all factor and character columns.
(numeric(1)
)
The smallest levels of a factor are merged until their combined proportion
w.r.t. the length of the factor exceeds min.perc
.
Must be between 0 and 1.
Default is 0.01.
(character(1)
)
New name of merged level.
Default is “.merged”
Task
, where merged levels are combined into a new level of name new.level
.
Other eda_and_preprocess: capLargeValues
,
createDummyFeatures
,
dropFeatures
,
normalizeFeatures
,
removeConstantFeatures
,
summarizeColumns
,
summarizeLevels