lex.div.num

num.tokens

num.types

A character vector defining the measures to calculate.

measure

A numeric value defining the base of the logarithm. See <code><a rd-options="base:log" href="/link/log?package=koRpus&version=0.10-2&to=base%3Alog" data-mini-rdoc="base:log::log">log</a></code> for details.

log.base

Logical. If <code>FALSE</code>, short status messages will be shown.
<code>TRUE</code> will also suppress all potential warnings regarding the validation status of measures.

quiet

This function is a stripped down version of <code><a rd-options="koRpus:lex.div" href="/link/lex.div?package=koRpus&version=0.10-2&to=koRpus%3Alex.div" data-mini-rdoc="koRpus:lex.div::lex.div">lex.div</a></code>. It does not analyze text,
but takes the numbers of tokens and types directly to calculate measures for which this information is sufficient:<ul>
<li><code>"TTR"</code>The classic Type-Token Ratio</li>
<li><code>"C"</code>Herdan's C</li>
<li><code>"R"</code>Guiraud's Root TTR</li>
<li><code>"CTTR"</code>Carroll's Corrected TTR</li>
<li><code>"U"</code>Dugast's Uber Index</li>
<li><code>"S"</code>Summer's index</li>
<li><code>"Maas"</code> Maas' (\(a^2\))</li>
</ul>See <code><a rd-options="koRpus:lex.div" href="/link/lex.div?package=koRpus&version=0.10-2&to=koRpus%3Alex.div" data-mini-rdoc="koRpus:lex.div::lex.div">lex.div</a></code> for further details on the formulae.

A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation,
several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG,
LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports
Celex and Leipzig Corpora Collection file formats) and measures like tf-idf. Support for additional languages can be
added on-the-fly or by plugin packages. Note: For full functionality a local installation of TreeTagger is recommended.
'koRpus' also includes a plugin for the R GUI and IDE RKWard, providing graphical dialogs for its basic features. The
respective R package 'rkward' cannot be installed directly from a repository, as it is a part of RKWard. To make full
use of this feature, please install RKWard from <https://rkward.kde.org> (plugins are detected automatically). Due to
some restrictions on CRAN, the full package sources are only available from the project homepage. To ask for help,
report bugs, request features, or discuss the development of the package, please subscribe to the koRpus-dev mailing
list (<http://korpusml.reaktanz.de>).

Meik Michalke

koRpus

An R Package for Text Analysis

m.eik michalke

Earl Brown

Alberto Mirisola

Alexandre Brulet

Laura Hauser

lex.div.num function

A numeric value defining the base of the logarithm. See <code><a rd-options='base:log' href='log'>log</a></code> for details.

This function is a stripped down version of <code><a rd-options='koRpus:lex.div' href='lex.div'>lex.div</a></code>. It does not analyze text,
but takes the numbers of tokens and types directly to calculate measures for which this information is sufficient:<ul>
<li><code>"TTR"</code>The classic Type-Token Ratio</li>
<li><code>"C"</code>Herdan's C</li>
<li><code>"R"</code>Guiraud's Root TTR</li>
<li><code>"CTTR"</code>Carroll's Corrected TTR</li>
<li><code>"U"</code>Dugast's Uber Index</li>
<li><code>"S"</code>Summer's index</li>
<li><code>"Maas"</code> Maas' (\(a^2\))</li>
</ul>See <code><a rd-options='koRpus:lex.div' href='lex.div'>lex.div</a></code> for further details on the formulae.

lex.div.num: Calculate lexical diversity

Description

Usage

Arguments

Value

References

See Also

Examples