Arguments
corpus
Either the path to directory with txt files
to read and analyze, or a vector object already holding
the text corpus. Can also be an already tokenized and
tagged text object which inherits class kRp.tagged
(then the column "toke
format
Either "file" or "obj", depending on
whether you want to scan files or analyze the given
object.
fileEncoding
A character string naming the
encoding of the corpus files.
quiet
Logical. If FALSE
, short status
messages will be shown.
caseSens
Logical. If FALSE
, all tokens will
be matched in their lower case form.
...
Additional options to be passed through to the
tokenize
function.