- diagrams
a list of persistence diagrams which are either the output of a persistent homology calculation like ripsDiag/calculate_homology
/PyH
, or diagram_to_df
.
- cv
a positive number at most the length of `diagrams` which determines the number of cross validation splits to be performed (default 1, aka no cross-validation). If `prob.model` is TRUE then cv is set to 1 since kernlab performs 3-fold CV internally in this case. When performing classification, classes are balanced within each cv fold.
- dim
a non-negative integer vector of homological dimensions in which the model is to be fit.
- t
either a vector of positive numbers representing the grid of values for the scale of the persistence Fisher kernel or NULL, default 1. If NULL then t is selected automatically, see details.
- sigma
a vector of positive numbers representing the grid of values for the bandwidth of the Fisher information metric, default 1.
- rho
an optional positive number representing the heuristic for Fisher information metric approximation, see diagram_distance
. Default NULL. If supplied, distance matrix calculations are sequential.
- y
a response vector with one label for each persistence diagram. Must be either numeric or factor, but doesn't need to be supplied when `type` is "one-svc".
- type
a string representing the type of task to be performed. Can be any one of "C-svc","nu-svc","one-svc","eps-svr","nu-svr" - default for regression is "eps-svr" and for classification is "C-svc". See ksvm
for details.
- distance_matrices
an optional list of precomputed Fisher distance matrices, corresponding to the rows in `expand.grid(dim = dim,sigma = sigma)`, default NULL.
- C
a number representing the cost of constraints violation (default 1) this is the 'C'-constant of the regularization term in the Lagrange formulation.
- nu
numeric parameter needed for nu-svc, one-svc and nu-svr. The `nu` parameter sets the upper bound on the training error and the lower bound on the fraction of data points to become Support Vector (default 0.2).
- epsilon
epsilon in the insensitive-loss function used for eps-svr, nu-svr and eps-bsvm (default 0.1).
- prob.model
if set to TRUE builds a model for calculating class probabilities or in case of regression, calculates the scaling parameter of the Laplacian distribution fitted on the residuals. Fitting is done on output data created by performing a 3-fold cross-validation on the training data. For details see references (default FALSE).
- class.weights
a named vector of weights for the different classes, used for asymmetric class sizes. Not all factor levels have to be supplied (default weight: 1). All components have to be named.
- fit
indicates whether the fitted values should be computed and included in the model or not (default TRUE).
- cache
cache memory in MB (default 40).
- tol
tolerance of termination criteria (default 0.001).
- shrinking
option whether to use the shrinking-heuristics (default TRUE).
- num_workers
the number of cores used for parallel computation, default is one less the number of cores on the machine.