Learn R Programming

imputeR: A General Multivariate Imputation Framework

imputeR is an R package that provides a general framework for missing values imputation based on automated variable selection.

The main function impute inputs a matrix containing missing values and returns a complete data matrix using the variable selection functions provided as part of the package, or written by the user.

The package also offers many useful tools for imputation research based on impute. For example, the Detect function can be used to detect the variables' type in a given data matrix. guess can be used for naive imputation such as mean imputation, median imputation, majority imputation (for categorical variables only) and random imputation. SimIm function stands for "simulation for imputation". It accepts a complete matrix and randomly introduce some percentage of missing values into the matrix so imputation methods can be employed subsequently to impute this artificial missing data matrix. Because the true values are actually know so imputation accuracy can be easily calculated. This calls for the SimEval function that extends SimIm function, simulates a number of missing data matrices, applies a imputation method to these missing matrices and evaluate its performance. This enables the uncertainty of the imputation method to be obtained.

Reference

You can cite imputeR the following:

Feng L, Moritz S, Nowak G, Welsh AH, O'Neill TJ (2018). _imputeR: A

General Multivariate Imputation Framework_. R package version 2.1, <URL: https://CRAN.R-project.org/package=imputeR>.

Version

2.1

License

GPL-3

Copy Link

Version

Install

install.packages('imputeR')

Monthly Downloads

288

Version

2.2

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Last Published

January 20th, 2020

Functions in imputeR (2.2)

guess

Impute by (educated) guessing
plotIm

Plot function for imputation
stepBothC

Best subset for classification (both direction)
pcrR

Principle component regression for imputation
stepBackR

Best subset (backward direction) for regression
Detect

Detect variable type in a data matrix
CubistR

Cubist method for imputation
orderbox

Ordered boxplot for a data matrix
ridgeR

Ridge shrinkage for regression
parkinson

Parkinsons Data Set
mixGuess

Naive imputation for mixed type data
ridgeC

Ridge regression with lasso for imputation
plsR

Partial Least Square regression for imputation
rpartC

classification tree for imputation
stepBackC

Best subset for classification (backward)
mixError

Calculate mixed error when the imputed matrix is mixed type
major

Majority imputation for a vector
spect

SPECT Heart Data Set
tic

Insurance Company Benchmark (COIL 2000) Data Set
stepForR

Best subset (forward direction) for regression
stepBothR

Best subset for regression (both direction)
mr

calculate miss-classification error
stepForC

Best subset for classification (forward direction)
Rmse

calculate the RMSE or NRMSE
impute

General Imputation Framework in R
imputeR-package

imputeR-package description
SimIm

Introduce some missing values into a data matrix
SimEval

Evaluate imputation performance by simulation
glmboostR

Boosting for regression
lassoC

logistic regression with lasso for imputation
lassoR

LASSO for regression
gbmC

boosting tree for imputation