A function for calculation of a proximity (dissimilarity) matrix based on the VM similarity measure.
Usage
vm(data)
Arguments
data
A data.frame or a matrix with cases in rows and variables in colums.
Value
The function returns a dissimilarity matrix of the size n x n, where n is the number of objects in the original dataset in the argument data.
Details
The Variable Mutability similarity measure was introduced in (Sulc and Rezankova, 2019).
It treats the similarity between two categories based on the within-cluster variability expressed by the normalized mutability. The measure assigns higher weights to rarer categories.
References
Sulc Z. and Rezankova H. (2019). Comparison of Similarity Measures for Categorical Data in Hierarchical Clustering. Journal of Classification. 2019, 35(1), p. 58-72. DOI: 10.1007/s00357-019-09317-5.