Select important variables from a set of features based on RANGER- or XGBLIN-estimated variable importance
varSelect(x, y, method = c("RANGER", "XGBLIN"), xgb.params = list(alpha
= 0.1, lambda = 0.1), p = 0.2, print.plot = TRUE, verbose = TRUE)
Matrix / Data Frame of Predictors
Outcome vector
String: "RANGER", "XGBLIN": Learner to use for estimating variable importace. Default = "RANGER"
List of parameters for method = "XGBLIN"
Float (0, 1): Fraction of variables in x to select. p * ncol(x)
. May help to set to a fraction twice
what you expect to be the true fraction of useful variables, to reduce false negatives at the expense of false
positives which can be dealt by an appropriate learning algorithm. (Default = .2)
Logical: If TRUE, print index plot of variable importance using mplot3.x
Logical: If TRUE, print messages to screen