simple_bin

train

test

variables to exclude (e.g. the target, or the row ID)

exclude_vars

if you only want certain variables binned, you may specify them
directly instead of excluding all other variables

include_vars

single number specifying the number of bins to create on each variable,
or a named list specifying cut-points for each variable

bins

if bins is given as a number, then this determines whether to create
bins with equal number of observations ("height") or of equal width
("width")

type

logical. Give missing values their own bin?

na_include


Function to apply simple equal-width or equal-height binning to columns of a
training dataset, and then optionally bin the columns of a test set into bins
with the appropriate cutpoints


Useful functions for performing common
data analysis tasks that I could not find in other packages. For
example, flexible functions for discretizing ("binning") continuous
variables are included, since this is a common technique used in
industry that is not well appreciated in academia. See the vignette
for a more detailed high-level explanation, or the documentation
for full details.

simple_bin: Discretize variables in your training and test datasets

Description

Usage

Arguments

Value

Details

See Also