- data.na
a dataframe with only numeric variables
- res.imputedata
an output from imputedata
- listvar
a character vector indicating for which subset of incomplete variables variable selection must be performed. By default all column names.
- nb.clust
the number of clusters used for imputation
- nnodes
number of CPU cores for parallel computing. By default, nnodes = 1
- sizeblock
an integer indicating the number of variables sampled at each iteration
- method.select
a single string indicating the variable selection method applied on each subset of variables
- B
number of iterations, by default B = 200
- r
a numerical vector (or a single real number) indicating the threshold used for each variable in listvar. Each value of r should be between 0 and 1. See details.
- graph
a boolean. If TRUE two graphics are plotted per variable in listvar
: a graphic reporting the variable importance measure of each explanatory variable and a graphic reporting the influence of the number iterations (B) on the importance measures
- printflag
a boolean. If TRUE, a message is printed at each iteration. Use printflag = FALSE for silent selection.
- path.outfile
a vector of strings indicating the path for redirection of print messages. Default value is NULL, meaning that silent imputation is performed. Otherwise, print messages are saved in the files path.outfile/output.txt. One file per node is generated.
- mar
a numerical vector of the form c(bottom, left, top, right). Only used if graph = TRUE
- cex.names
expansion factor for axis names (bar labels) (only used if graph = TRUE)
- modelNames
a vector of character strings indicating the models to be fitted in the EM phase of clustering