Learn R Programming

cluster.datasets (version 1.0-1)

linguistic.relatedness: Hartigan (1975) Relatedness Values of Selected Words

Description

Frequencies with which a pair is judged more highly related than other pairs, over many triads and subjects. This is Table 10.4 in Chapter 10 of Hartigan (1975) on page 184.

Usage

data(linguistic.relatedness)

Arguments

Format

A data frame with 6 observations on the following 7 variables.
word
a character vector for the
the
a numeric vector for the frequency with which words are related to 'the'
boy
a numeric vector for the frequency with which words are related to 'boy'
has
a numeric vector for the frequency with which words are related to 'has'
lost
a numeric vector for the frequency with which words are related to 'lost'
a
a numeric vector for the frequency with which words are related to 'a'
dollar
a numeric vector for the frequency with which words are related to 'dollar'

Source

Levelt, W. J. M (1967). Psychological representations of syntactic structures, in The Structure and Psychology of Language, T. G. Bever and W. Weksel, eds, Holt, Rinehart and Winston, New York. SPAETH2 Cluster Analysis Datasets http://people.sc.fsu.edu/~jburkardt/datasets/spaeth2/spaeth2.html

Details

This is an unusual data set to be used with the triads-leader algorithm.

References

Hartigan, J. A. (1975). Clustering Algorithms, John Wiley, New York.

Examples

Run this code
data(linguistic.relatedness)

Run the code above in your browser using DataLab