smoothMean

Calculates target encodings using a smoothing parameter and count of categorical variables.
This approach is more robust to possibility of leakage and avoid overfitting.

The idea is to provide a standard interface
to users who use both R and Python for building machine learning models.
This package provides a scikit-learn's fit, predict interface to
train machine learning models in R.

Manish Saraswat

superml

Build Machine Learning Models Like Using Python's Scikit-Learn
Library in R

smoothMean function

<dl><dt>train_df</dt>
<dd>train dataset</dd>
<dt>test_df</dt>
<dd>test dataset</dd>
<dt>colname</dt>
<dd>name of categorical column</dd>
<dt>target</dt>
<dd>name of target column</dd>
<dt>min_samples_leaf</dt>
<dd>minimum samples to take category average into account</dd>
<dt>smoothing</dt>
<dd>smoothing effect to balance categorical average vs prior</dd>
<dt>noise_level</dt>
<dd>random noise to add, optional</dd></dl>

Arguments

smoothMean Calculator — smoothMean

<dl>

<dt>train_df</dt>
<dd>train dataset</dd>


<dt>test_df</dt>
<dd>test dataset</dd>


<dt>colname</dt>
<dd>name of categorical column</dd>


<dt>target</dt>
<dd>name of target column</dd>


<dt>min_samples_leaf</dt>
<dd>minimum samples to take category average into account</dd>


<dt>smoothing</dt>
<dd>smoothing effect to balance categorical average vs prior</dd>


<dt>noise_level</dt>
<dd>random noise to add, optional</dd>

</dl>

smoothMean: smoothMean Calculator

Description

Usage

Value

Arguments

Examples