Compare classifications via the normalized
variation of information criterion.
Usage
compareClass(a, b)
Arguments
a
A numeric or character vector of class labels.
b
A numeric or character vector of class labels.
Must have the same length as
a.
Value
The variation of information criterion (Meila 2002) for a and
b divided by the log of the length of the sequences so that it
falls in [0,1].
References
Marina Meila (2002).
Comparing clusterings.
Technical Report no. 418, Department of Statistics, University
of Washington.
See http://www.stat.washington.edu/www/research/reports.
a <- rep(1:3, 3)
a
b <- rep(c("A", "B", "C"), 3)
b
compareClass(a, b)
a <- sample(1:3, 9, replace = TRUE)
a
b <- sample(c("A", "B", "C"), 9, replace = TRUE)
b
compareClass(a, b)