Method that evaluates clustering algorithm from a file directory or dataframe.
execute_package_parallel(
directory_files,
df,
algorithms_execute,
measures_execute,
cluster_min,
cluster_max,
metrics_execute,
attributes,
number_algorithms,
numberClusters,
numberDataSets,
is_metric_external,
is_metric_internal,
name_dataframe
)
It's a string with the route where the datasets are located.
Data matrix or data frame, or dissimilarity matrix, depending on the value of the argument.
Character vector with the algorithms to be executed. The algorithms implemented are: hclust, apclusterK,agnes,clara,daisy,diana,fanny,mona,pam,gmm,kmeans_arma, kmeans_rcpp,mini_kmeans, pvclust.
Character array with the measurements of dissimilarity to be executed. Depending on the algorithm, one or the other is implemented. Among them we highlight: Euclidena, Manhattan, etc.
Minimum number of clusters.
Maximum number of clusters. cluster_max must be greater or equal cluster_min.
Character array defining the metrics to be executed. The night metrics implemented are: Entropy, Variation_information, Precision, Recall, F_measure, Fowlkes_mallows_index, Connectivity, Dunn and Silhouette.
It's a numeric field with the number of algorithms.
It's a numeric field with the difference between clusters.
It's a numeric field with the number of datasets.
Boolean field to indicate whether to run external metrics.
Boolean field to indicate whether to run internal metrics.
Name of data.frame when is fill.
Returns a list with the result matrix of evaluating the data from the indicated algorithms, metrics and number of clusters.