Contains 8000 2-d points, with 6 "natural" looking shapes, all of which have an sinusoid-like shape that intersects
with each cluster.
Usage
data("DS3")
Arguments
Format
A data frame with 8000 observations on the following 2 variables.
X
a numeric vector
Y
a numeric vector
Details
Originally used as a benchmark data set for the Chameleon clustering algorithm[1] to illustrate the a data set
containing arbitrarily shaped spatial data surrounded by both noise and artifacts.
References
Karypis, George, Eui-Hong Han, and Vipin Kumar (1999). "Chameleon: Hierarchical clustering using dynamic modeling." Computer 32(8): 68-75.