This is a synthetic dataset that contains groups of different density points, varied shapes, and necks between partitions.
data(compound)
A data frame containing 399 observations and two dimensions, forming six partitions:
x1: synthetically generated real positive values
x2: synthetically generated real positive values
Originally, the dataset had contained three dimensions. We intentionally removed the third dimension that corresponds to the label which the data point belongs. All description about the data set may be found in Graph-theoretical methods for detecting and describing gestalt clusters article, in the references.
C.T. Zahn, Graph-theoretical methods for detecting and describing gestalt clusters. IEEE Transactions on Computers, 1971. 100(1): p. 68-86.
P. Franti and S. Sieranoja, K-means properties on six clustering benchmark datasets, vol. 48, no. 12. pp. 4743-4759, 2018.