Learn R Programming

languageR (version 1.5.0)

durationsOnt: Durational measurements on the Dutch prefix ont-

Description

Durational measurements on the Dutch prefix ont- in the Spoken Dutch Corpus.

Usage

data(durationsOnt)

Arguments

Format

A data frame with 102 observations on the following 11 variables.

Word

a factor with the words as levels.

Frequency

a numeric vector with the word's logarithmically transformed frequency in the Spoken Dutch Corpus.

Speaker

a factor with speakers as levels.

Sex

a factor with levels female and male.

YearOfBirth

a numeric vector coding year of birth of the speaker - 1900.

DurationOfPrefix

a numeric vector for the duration of ont- in seconds

DurationPrefixVowel

a numeric vector for the duration of the vowel in the prefix in seconds.

DurationPrefixNasal

a numeric vector for the duration of the nasal in the prefix in seconds.

DurationPrefixPlosive

a numeric vector for the duration of the plosive in the prefix in seconds.

NumberOfSegmentsOnset

a numeric vector for the number of segments in the onset of the stem.

PlosivePresent

a factor with levels no and yes for whether the plosive is realized in the signal.

SpeechRate

a numeric vector coding speech rate in number of syllables per second.

References

Pluymaekers, M., Ernestus, M. and Baayen, R. H. (2005) Frequency and acoustic length: the case of derivational affixes in Dutch, Journal of the Acoustical Society of America, 118, 2561-2569.

Examples

Run this code
# NOT RUN {
data(durationsOnt)

###### modeling the duration of the prefix

prefix.lm = lm(DurationOfPrefix ~ (YearOfBirth + SpeechRate) * Frequency, 
   data = durationsOnt)
summary(prefix.lm)

# ---- model criticism

plot(prefix.lm)
outliers = c(36, 35, 17, 72)
prefix.lm = lm(DurationOfPrefix ~ (YearOfBirth + SpeechRate) * Frequency, 
   data = durationsOnt[-outliers,])
summary(prefix.lm)

###### modeling the presence of the /t/ 

library(rms)
durationsOnt.dd = datadist(durationsOnt)
options(datadist = 'durationsOnt.dd')

plosive.lrm = lrm(PlosivePresent ~ SpeechRate + YearOfBirth, 
   data = durationsOnt, x = TRUE, y = TRUE)
plosive.lrm
validate(plosive.lrm, bw = TRUE, B = 200)

###### modeling the duration of the /n/

nasal.lm = lm(DurationPrefixNasal ~ PlosivePresent + Frequency + 
   YearOfBirth, data = durationsOnt)
summary(nasal.lm)

# ---- model criticism

plot(nasal.lm)
outliers = c(71, 28, 62, 33)
nasal.lm = lm(DurationPrefixNasal ~ PlosivePresent + Frequency + 
   YearOfBirth, data = durationsOnt[-outliers,])
summary(nasal.lm)

# }

Run the code above in your browser using DataLab