powered by
Read data from Celex[1] formatted corpora.
read.corp.celex(celex.path, running.words, fileEncoding = "ISO_8859-1", n = -1)
A character string, path to a frequency file in Celex format to read.
An integer value, number of running words in the Celex data corpus to be read.
A character string naming the encoding of the Celex files.
An integer value defining how many lines of data should be read if format="flatfile". Reads all at -1.
format="flatfile"
An object of class kRp.corp.freq-class.
kRp.corp.freq-class
[1] http://celex.mpi.nl
# NOT RUN { my.Celex.data <- read.corp.celex("~/mydata/Celex/GERMAN/GFW/GFW.CD", running.words=5952000) freq.analysis("/some/text.txt", corp.freq=my.Celex.data) # }
Run the code above in your browser using DataLab