This function will take the frequency-ranked set of variables and will generate a new model with terms that meet the net residual improvement (NeRI) threshold criteria.
updateModel.Res(Outcome,
covariates = "1",
pvalue = c(0.025, 0.05),
VarFrequencyTable,
variableList,
data,
type = c("LM", "LOGIT", "COX"),
testType=c("Binomial", "Wilcox", "tStudent"),
lastTopVariable = 0,
timeOutcome = "Time",
maxTrainModelSize = -1,
p.thresholds = NULL
)
An object of class lm, glm, or coxph containing the final model
A vector with the names of the features that were included in the final model
An object of class formula with the formula used to fit the final model
A vector in which each element represents the z-score of the NeRI, associated to the testType, for each feature found in the final model
The name of the column in data that stores the variable to be predicted by the model
A string of the type "1 + var1 + var2" that defines which variables will always be included in the models (as covariates)
The maximum p-value, associated to the NeRI, allowed for a term in the model
An array with the ranked frequencies of the features, (e.g. the ranked.var value returned by the ForwardSelection.Model.Res function)
A data frame with two columns. The first one must have the names of the candidate variables and the other one the description of such variables
A data frame where all variables are stored in different columns
Fit type: Logistic ("LOGIT"), linear ("LM"), or Cox proportional hazards ("COX")
Type of non-parametric test to be evaluated by the improvedResiduals function: Binomial test ("Binomial"), Wilcoxon rank-sum test ("Wilcox"), Student's t-test ("tStudent"), or F-test ("Ftest")
The maximum number of variables to be tested
The name of the column in data that stores the time to event (needed only for a Cox proportional hazards regression model fitting)
Maximum number of terms that can be included in the model
The p.value thresholds estimated in forward selection
Jose G. Tamez-Pena and Antonio Martinez-Torteya
updateModel.Bin