Convert numeric entries which large/infinite (absolute) values in a data.frame or task. Only numeric/integer columns are affected.
capLargeValues(
obj,
target = character(0L),
cols = NULL,
threshold = Inf,
impute = threshold,
what = "abs"
)
(data.frame | Task)
Input data.
(character)
Name of the column(s) specifying the response.
Target columns will not be capped.
Default is character(0)
.
(character)
Which columns to convert.
Default is all numeric columns.
(numeric(1)
)
Threshold for capping.
Every entry whose absolute value is equal or larger is converted.
Default is Inf
.
(numeric(1)
)
Replacement value for large entries.
Large negative entries are converted to -impute
.
Default is threshold
.
(character(1)
)
What kind of entries are affected?
“abs” means abs(x) > threshold
,
“pos” means abs(x) > threshold && x > 0
,
“neg” means abs(x) > threshold && x < 0
.
Default is “abs”.
Other eda_and_preprocess:
createDummyFeatures()
,
dropFeatures()
,
mergeSmallFactorLevels()
,
normalizeFeatures()
,
removeConstantFeatures()
,
summarizeColumns()
,
summarizeLevels()
capLargeValues(iris, threshold = 5, impute = 5)
Run the code above in your browser using DataLab