smoothMean

train_df

test_df

colname

target

minimum samples to take category average into account

min_samples_leaf

smoothing effect to balance categorical average vs prior

smoothing

noise_level

Calculates target encodings using a smoothing parameter and count of categorical variables.
This approach is more robust to possibility of leakage and avoid overfitting.

The idea is to provide a standard interface
to users who use both R and Python for building machine learning models.
This package provides a scikit-learn's fit, predict interface to
train machine learning models in R.

smoothMean: smoothMean Calculator

Description

Usage

Arguments

Value

Examples