Learn R Programming

languageR (version 1.5.0)

durationsGe: Durational measurements on the Dutch prefix ge-

Description

Durational measurements on the Dutch prefix ge- in the Spoken Dutch Corpus.

Usage

data(durationsGe)

Arguments

Format

A data frame with 428 observations on the following 8 variables.

Word

a factor with the words as levels.

Frequency

a numeric vector with the word's absolute frequency in the Spoken Dutch Corpus.

Speaker

a factor with the speakers as levels.

Sex

a factor with levels female and male, this information is missing for one speaker.

YearOfBirth

a numeric vector with years of birth.

DurationOfPrefix

a numeric vector with the duration of the prefix -ont in seconds.

SpeechRate

a numeric vector coding speech rate in number of syllables per second.

NumberSegmentsOnset

a numeric vector for the number of segments in the onset of the stem.

References

Pluymaekers, M., Ernestus, M. and Baayen, R. H. (2005) Frequency and acoustic length: the case of derivational affixes in Dutch, Journal of the Acoustical Society of America, 118, 2561-2569.

Examples

Run this code
# NOT RUN {
	
# }
# NOT RUN {
    data(durationsGe)
    durationsGe$Frequency = log(durationsGe$Frequency + 1)
    durationsGe$YearOfBirth = durationsGe$YearOfBirth - 1900

    durationsGe.lm = lm(DurationOfPrefix ~ Frequency+SpeechRate, data = durationsGe)
    summary(durationsGe.lm)

    # ---- model criticism
    
    plot(durationsGe.lm)
    outliers = c(271, 392, 256, 413, 118, 256)
    durationsGe.lm = lm(DurationOfPrefix ~ Frequency + SpeechRate, 
      data = durationsGe[-outliers, ])
    summary(durationsGe.lm)
  
# }
# NOT RUN {
# }

Run the code above in your browser using DataLab