- data
matrix or data.frame. If both clara_samples and clara_sample_size equal 0, then the data parameter can be also a dissimilarity matrix, where the main diagonal equals 0.0 and the number of rows equals the number of columns
- max_clusters
either a numeric value, a contiguous or non-continguous numeric vector specifying the cluster search space
- distance_metric
a string specifying the distance method. One of, euclidean, manhattan, chebyshev, canberra, braycurtis, pearson_correlation, simple_matching_coefficient, minkowski, hamming, jaccard_coefficient, Rao_coefficient, mahalanobis, cosine
- criterion
one of 'dissimilarity' or 'silhouette'
- clara_samples
number of samples to draw from the data set in case of clustering large applications (clara)
- clara_sample_size
fraction of data to draw in each sample iteration in case of clustering large applications (clara). It should be a float number greater than 0.0 and less or equal to 1.0
- minkowski_p
a numeric value specifying the minkowski parameter in case that distance_metric = "minkowski"
- swap_phase
either TRUE or FALSE. If TRUE then both phases ('build' and 'swap') will take place. The 'swap_phase' is considered more computationally intensive.
- threads
an integer specifying the number of cores to run in parallel. Openmp will be utilized to parallelize the number of sample draws
- verbose
either TRUE or FALSE, indicating whether progress is printed during clustering
- plot_clusters
TRUE or FALSE, indicating whether the iterative results should be plotted. See the details section for more information
- seed
integer value for random number generator (RNG)