Learn R Programming

quanteda (version 1.3.13)

data-internal: Internal data sets

Description

Data sets used for mainly internal purposes by the quanteda package.

Usage

data_int_syllables

data_char_wordlists

Arguments

Format

An object of class integer of length 133245.

Details

data_int_syllables provides an English-language syllables dictionary; it is an integer vector whose element names correspond to English words.

data_char_wordlists provides word lists used in some readability indexes; it is a named list of character vectors where each list element corresponds to a different readability index.

These are:

DaleChall

The long Dale-Chall list of 3,000 familiar (English) words needed to compute the Dale-Chall Readability Formula.

Spache

The revised Spache word list (see Klare 1975, 73) needed to compute the Spache Revised Formula of readability (Spache 1974.

References

Chall, J. S., & Dale, E. 1995. Readability Revisited: The New Dale-Chall Readability Formula. Brookline Books.

Dale, Edgar, and Jeanne Sternlicht Chall. 1948. "A Formula for Predicting Readability". Educational Research Bulletin 27(1): 11-20.

Dale, Edgar, and Jeanne S Chall. 1948. "A Formula for Predicting Readability: Instructions." Educational Research Bulletin 27(2): 37<U+2013>54.

Klare, G. R. 1975. "Assessing readability." Reading Research Quarterly 10(1): 62-102.

Spache, G. 1953. "A new readability formula for primary-grade reading materials." The Elementary School Journal 53: 410-413.