Learn R Programming

inaparc (version 1.2.0)

ssamp: Initialization of cluster prototypes using systematic random sampling

Description

Initializes the cluster prototypes matrix using the systemically sampled data objects.

Usage

ssamp(x, k)

Arguments

x

a numeric vector, data frame or matrix.

k

an integer for the number of clusters.

Value

an object of class ‘inaparc’, which is a list consists of the following items:

v

a numeric matrix containing the initial cluster prototypes.

ctype

a string representing the type of centroid, which used to build prototype matrix. Its value is ‘obj’ with this function because the cluster prototype matrix contains the sampled objects.

call

a string containing the matched function call that generates this ‘inaparc’ object.

Details

The function ssamp generates a prototype matrix using the sytematic random sampling technique. Since the data objects are enough away from each other with this technique it may provide better initializations than the simple random sampling. The first object is randomly sampled from the top n/k objects of data set and assigned as the prototype of first cluster. The prototypes of remaining clusters are the objects whose row indexes are \(v_{1} + i\;(n/k)\), where v1 and i are the index of first selected object and index of cluster, respectively.

See Also

aldaoud, ballhall, crsamp, firstk, forgy, hartiganwong, inofrep, inscsf, insdev, kkz, kmpp, ksegments, ksteps, lastk, lhsmaximin, lhsrandom, maximin, mscseek, rsamp, rsegment, scseek, scseek2, spaeth, topbottom, uniquek, ursamp

Examples

Run this code
# NOT RUN {
data(iris)
res <- ssamp(x=iris[,1:4], k=5)
v <- res$v
print(v)
# }

Run the code above in your browser using DataLab