Learn R Programming

nomclust (version 2.1.6)

good3: Goodall 3 (G3) Measure

Description

A function for calculation of a proximity (dissimilarity) matrix based on the G3 similarity measure.

Usage

good3(data)

Arguments

data

A data.frame or a matrix with cases in rows and variables in colums.

Value

The function returns a dissimilarity matrix of the size n x n, where n is the number of objects in the original dataset in the argument data.

Details

The Goodall 3 similarity measure was presented in (Boriah et al., 2008). It is a simple modification of the original Goodall measure (Goodall, 1966). The measure assigns higher weight if the infrequent categories match regardless on frequencies of other categories.

References

Boriah S., Chandola V., Kumar V. (2008). Similarity measures for categorical data: A comparative evaluation. In: Proceedings of the 8th SIAM International Conference on Data Mining, SIAM, p. 243-254.

Goodall V.D. (1966). A new similarity index based on probability. Biometrics, 22(4), p. 882.

See Also

eskin, good1, good2, good4, iof, lin, lin1, morlini, of, sm, ve, vm.

Examples

Run this code
# NOT RUN {
# sample data
data(data20)

# dissimilarity matrix calculation
prox.good3 <- good3(data20)

# }

Run the code above in your browser using DataLab