Learn R Programming

scorecardModelUtils (version 0.0.1.0)

Credit Scorecard Modelling Utils

Description

Provides infrastructure functionalities such as missing value treatment, information value calculation, GINI calculation etc. which are used for developing a traditional credit scorecard as well as a machine learning based model. The functionalities defined are standard steps for any credit underwriting scorecard development, extensively used in financial domain.

Copy Link

Version

Install

install.packages('scorecardModelUtils')

Monthly Downloads

151

Version

0.0.1.0

License

GPL-2 | GPL-3

Maintainer

Arya Poddar

Last Published

April 14th, 2019

Functions in scorecardModelUtils (0.0.1.0)

dtree_trend_iv

Recursive Decision Tree partitioning with monotonic event rate along with IV table for individual numerical variable
fn_error

Computes error measures between observed and predicted values
fn_mode

Calculating mode value of a vector
scoring

Scoring a dataset with class based on a scalling logic to arrive at final score
scalling

Converting coefficients of logistic regression into scores for scorecard building
univariate

Univariate analysis of variables
support_vector_parameters

Hyperparameter optimisation or parameter tuning for Suppert Vector Machine by grid search
missing_val

Missing value imputation
iv_table

WOE and IV table for list of numerical and categorical variables
num_to_cat

Binning numerical variables based on cuts from IV table
fn_target

Redefines target value
gradient_boosting_parameters

Hyperparameter optimisation or parameter tuning for Gradient Boosting Regression Modelling by grid search
others_class

Clubbing of classes of categorical variable with low population percentage into one class
vif_filter

Removing multicollinearity from a model using vif test
gini_table

Performance measure table with Gini coefficient, KS-statistics and Gini lift curve
random_forest_parameters

Hyperparameter optimisation or parameter tuning for Random Forest by grid search
sampling

Random sampling of data into train and test
iv_filter

Variable reduction based on Information Value filter
club_cat_class

Clubbing class of a categorical variable with low population percentage with another class of similar event rate
cv_filter

Variable reduction based on Cramer's V filter
cat_new_class

Clubbing class of categorical variables with low population percentage with another class of similar event rate
categorical_iv

IV table for individual categorical variable
fn_conf_mat

Creates confusion matrix and its related measures
fn_cross_index

Creates random index for k-fold cross validation
cv_table

Pairwise Cramer's V among a list of categorical variables
cv_test

Cramer's V value between two categorical variables
dtree_split_val

Getting the split value for terminal nodes from decision tree