Learn R Programming

rerf (version 2.0.4)

defaults: Default values passed to RandMat*

Description

Given the parameter list and the categorical map this function populates the values of the parameter list accoding to our "best" known general use case parameters.

Usage

defaults(ncolX, paramList, cat.map)

Arguments

ncolX

an integer denoting the number of columns in the design matrix X.

paramList

a list (possibly empty), to be populated with a set of default values to be passed to a RandMat* function.

cat.map

a list specifying which columns in X correspond to the same one-of-K encoded feature. Each element of cat.map is a numeric vector specifying the K column indices of X corresponding to the same categorical feature after one-of-K encoding. All one-of-K encoded features in X must come after the numeric features. The K encoded columns corresponding to the same categorical feature must be placed contiguously within X. The reason for specifying cat.map is to adjust for the fact that one-of-K encoding cateogorical features results in a dilution of numeric features, since a single categorical feature is expanded to K binary features. If cat.map = NULL, then RerF assumes all features are numeric (i.e. none of the features have been one-of-K encoded).

Value

If cat.map is NULL, then

  • p is set to the number of columns of X

  • d is set to the ceiling of the square root of the number of columns of X

  • sparsity: if \(\code{ncol(X)} \ge 10\), then sparsity is set to 3 / ncol{X}, otherwise it is set to 1 / ncol(X).

  • prob defaults to 0.5.