Learn R Programming

languageR (version 1.5.0)

havelaar: The determiner 'het' in the Dutch novel Max Havelaar

Description

The frequency of the determiner 'het' in the Dutch novel 'Max Havelaar' by Multatuli (Eduard Douwes Dekker), in 99 consecutive text fragments of 1000 tokens each.

Usage

data(havelaar)

Arguments

Format

A data frame with 99 observations on the following 2 variables.

Chunk

a numeric vector with the indices of the text fragments.

Frequency

a numeric vector with the frequencies of the determiner 'het' in the text fragments.

Examples

Run this code
# NOT RUN {
data(havelaar)

n = 1000                          # token size of text fragments
p = mean(havelaar$Frequency / n)  # relative frequencies

plot(qbinom(ppoints(99), n, p), sort(havelaar$Frequency),
   xlab = paste("quantiles of (", n, ",", round(p, 4), 
   ")-binomial", sep=""), ylab = "frequencies")


lambda = mean(havelaar$Frequency)
ks.test(havelaar$Frequency, "ppois", lambda)
ks.test(jitter(havelaar$Frequency), "ppois", lambda)


# }

Run the code above in your browser using DataLab