calculate_variable_split

calculate_variable_split.default

This function calculate candidate splits for each selected variable.
For numerical variables splits are calculated as percentiles
(in general uniform quantiles of the length grid_points).
For all other variables splits are calculated as unique values.

Collection of tools for assessment of feature importance and feature effects.
Key functions are:
feature_importance() for assessment of global level feature importance,
ceteris_paribus() for calculation of the what-if plots,
partial_dependence() for partial dependence plots,
conditional_dependence() for conditional dependence plots,
accumulated_dependence() for accumulated local effects plots,
aggregate_profiles() and cluster_profiles() for aggregation of ceteris paribus profiles,
generic print() and plot() for better usability of selected explainers,
generic plotD3() for interactive, D3 based explanations, and
generic describe() for explanations in natural language.
The package 'ingredients' is a part of the 'DrWhy.AI' universe (Biecek 2018) <arXiv:1806.08915>.

Przemyslaw Biecek

ingredients

Effects and Importances of Model Ingredients

Hubert Baniecki

Adam Izdebski

calculate_variable_split function

<dl><dt>data</dt>
<dd>validation dataset. Is used to determine distribution of observations.</dd>
<dt>variables</dt>
<dd>names of variables for which splits shall be calculated</dd>
<dt>grid_points</dt>
<dd>number of points used for response path</dd>
<dt>variable_splits_type</dt>
<dd>how variable grids shall be calculated? Use "quantiles" (default) for percentiles or "uniform" to get uniform grid of points</dd>
<dt>new_observation</dt>
<dd>if specified (not <code>NA</code>) then all values in <code>new_observation</code> will be included in <code>variable_splits</code></dd></dl>

Arguments

Internal Function for Split Points for Selected Variables — calculate_variable_split

<dl>

<dt>data</dt>
<dd>validation dataset. Is used to determine distribution of observations.</dd>


<dt>variables</dt>
<dd>names of variables for which splits shall be calculated</dd>


<dt>grid_points</dt>
<dd>number of points used for response path</dd>


<dt>variable_splits_type</dt>
<dd>how variable grids shall be calculated? Use "quantiles" (default) for percentiles or "uniform" to get uniform grid of points</dd>


<dt>new_observation</dt>
<dd>if specified (not <code>NA</code>) then all values in <code>new_observation</code> will be included in <code>variable_splits</code></dd>

</dl>

calculate_variable_split: Internal Function for Split Points for Selected Variables

Description

Usage

Value

Arguments

Details