Generates a synthetic categorical variable using ordered polytomous regression (without or with bootstrap).
syn.polr(y, x, xp, proper = FALSE, maxit = 1000, trace = FALSE,
MaxNWts = 10000, ...)
A list with two components:
a vector of length k
with synthetic values of y
.
a summary of the model fitted to the observed data and used to produce synthetic values.
an original data vector of length n
.
a matrix (n
x p
) of original covariates.
a matrix (k
x p
) of synthesised covariates.
for proper synthesis (proper = TRUE
)
a model is fitted to a bootstrapped sample of the original data.
the maximum number of iterations for nnet
.
switch for tracing optimization for nnet
.
the maximum allowable number of weights for nnet
.
Generates synthetic ordered categorical variables by the proportional odds logistic regression (polr) model. The function repeatedly applies logistic regression on the successive splits. The model is also known as the cumulative link model.
The algorithm of syn.polr
uses the
function polr
from the MASS package.
In order to avoid bias due to perfect prediction, the data are augmented by the method of White, Daniel and Royston (2010).
In case the call to polr
fails,
usually because the data are very sparse,
multinom
function is used instead.
White, I.R., Daniel, R. and Royston, P. (2010). Avoiding bias due to perfect prediction in multiple imputation of incomplete categorical variables. Computational Statistics and Data Analysis, 54, 2267--2275.