read.corp.custom

Either the path to directory with txt files
  to read and analyze, or a vector object already holding
  the text corpus.  Can also be an already tokenized and
  tagged text object which inherits class <code>kRp.tagged</code>
  (then the column <code>"toke</code>

corpus

Either "file" or "obj", depending on
  whether you want to scan files or analyze the given
  object.

format

A character string naming the
  encoding of the corpus files.

fileEncoding

Logical. If <code>FALSE</code>, short status
  messages will be shown.

quiet

Logical. If <code>FALSE</code>, all tokens will
  be matched in their lower case form.

caseSens

Additional options to be passed through to the
  <code>tokenize</code> function.

Read data from a custom corpus into a valid object of
  class <code><a href="/link/kRp.corp.freq-class?package=koRpus&version=0.04-40&to=koRpus" rd-options="koRpus" data-mini-rdoc="koRpus::kRp.corp.freq-class">kRp.corp.freq-class</a></code>.

corpora

A set of tools to analyze texts. Includes, amongst others,
        functions for automatic language detection, hyphenation,
        several indices of lexical diversity (e.g., type token ratio,
        HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX,
        Dale-Chall). Basic import functions for language corpora are
        also provided, to enable frequency analyses (supports Celex and
        Leipzig Corpora Collection file formats).  #' Note: For full
        functionality a local installation of TreeTagger is
        recommended.  Be encouraged to send feedback to the author(s)!

read.corp.custom: Import custom corpus data

Description

Usage

Arguments

Value

Details

See Also

Examples