Learn R Programming

koRpus (version 0.04-40)

read.corp.celex: Import Celex data

Description

Read data from Celex[1] formatted corpora.

Usage

read.corp.celex(celex.path, running.words,
    fileEncoding = "ISO_8859-1", n = -1)

Arguments

celex.path
A character string, path to a frequency file in Celex format to read.
running.words
An integer value, number of running words in the Celex data corpus to be read.
fileEncoding
A character string naming the encoding of the Celex files.
n
An integer value defining how many lines of data should be read if format="flatfile". Reads all at -1.

Value

References

[1] http://celex.mpi.nl

See Also

kRp.corp.freq-class

Examples

Run this code
my.Celex.data <- read.corp.celex("~/mydata/Celex/GERMAN/GFW/GFW.CD", running.words=5952000)
freq.analysis("/some/text.txt", corp.freq=my.Celex.data)

Run the code above in your browser using DataLab