Prepare a data.table by:
transforming numeric variables into factors whenever they take less than thresh
unique
variables
transforming characters using generateFromCharacter
transforming logical into binary integers
dropping constant columns
Sending the data.table to setAsNumericMatrix
(when finalForm == "numerical_matrix"
)
will then allow you to get a numerical matrix usable by most Machine Learning Algorithms.
shapeSet(dataSet, finalForm = "data.table", thresh = 10, verbose = TRUE)
Matrix, data.frame or data.table
"data.table" or "numerical_matrix" (default to data.table)
Threshold such that a numerical column is transformed into
a factor whenever its number of unique modalities is smaller or equal to
thresh
(numeric, default to 10)
Should the algorithm talk? (logical, default to TRUE)
All these changes will happen by reference.