Merges factor levels that occur only infrequently into combined levels with a higher frequency.
mergeSmallFactorLevels(
task,
cols = NULL,
min.perc = 0.01,
new.level = ".merged"
)
Task
, where merged levels are combined into a new level of name new.level
.
(Task)
The task.
(character) Which columns to convert. Default is all factor and character columns.
(numeric(1)
)
The smallest levels of a factor are merged until their combined proportion
w.r.t. the length of the factor exceeds min.perc
.
Must be between 0 and 1.
Default is 0.01.
(character(1)
)
New name of merged level.
Default is “.merged”
Other eda_and_preprocess:
capLargeValues()
,
createDummyFeatures()
,
dropFeatures()
,
normalizeFeatures()
,
removeConstantFeatures()
,
summarizeColumns()
,
summarizeLevels()