summary

summary,kRp.lang-method

summary,kRp.TTR-method

summary,kRp.readability-method

summary,kRp.text-method

An object of class, <code>kRp.lang</code>, <code>kRp.readability</code>, 
<code>kRp.text</code>, or <code>kRp.TTR</code>.

object

Further options, depending on the object class.

Logical, if <code>TRUE</code> and <code>feature="lex_div"</code> or <code>"readability"</code>,
 a named vector of main
results is returned. For objects containig more than one <code>doc_id</code>,
 defaults to <code>TRUE</code> automatically and
returns a data frame with named rows.

flat

Either a vector indicating which rows should be considered as transformed for the statistics,
or the name of a particular transformation that was previously done to the object,
 if more than one transformation was applied.
If <code>NA</code>, all rows where <code>"equal"</code> is <code>FALSE</code> are used.
Only valid for objects providing a <code>diff</code> feature.

index

A character string naming a feature present in the object,
 to trigger a summary regarding that feature.
Currently only <code>"freq"</code>, <code>"lex_div"</code>, and <code>"readability"</code> are implemented.

feature

Summary method for S4 objects of classes
<code>kRp.lang</code>,
<code>kRp.readability</code>,
<code>kRp.text</code>, or
<code>kRp.TTR</code>.

methods

A set of tools to analyze texts. Includes, amongst others, functions for
automatic language detection, hyphenation, several indices of lexical diversity
(e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch,
SMOG, LIX, Dale-Chall). Basic import functions for language corpora are also
provided, to enable frequency analyses (supports Celex and Leipzig Corpora
Collection file formats) and measures like tf-idf. Note: For full functionality
a local installation of TreeTagger is recommended. It is also recommended to
not load this package directly, but by loading one of the available language
support packages from the 'l10n' repository
<https://undocumeantit.github.io/repos/l10n/>. 'koRpus' also includes a plugin
for the R GUI and IDE RKWard, providing graphical dialogs for its basic
features. The respective R package 'rkward' cannot be installed directly from a
repository, as it is a part of RKWard. To make full use of this feature, please
install RKWard from <https://rkward.kde.org> (plugins are detected
automatically). Due to some restrictions on CRAN, the full package sources are
only available from the project homepage. To ask for help, report bugs, request
features, or discuss the development of the package, please subscribe to the
koRpus-dev mailing list (<https://korpusml.reaktanz.de>).

Meik Michalke

koRpus

Text Analysis with Emphasis on POS Tagging, Readability, and
Lexical Diversity

Earl Brown

Alberto Mirisola

Alexandre Brulet

Laura Hauser

summary function

<dl><dt>object</dt>
<dd>An object of class, <code>kRp.lang</code>, <code>kRp.readability</code>, 
<code>kRp.text</code>, or <code>kRp.TTR</code>.</dd>
<dt>...</dt>
<dd>Further options, depending on the object class.</dd>
<dt>flat</dt>
<dd>Logical, if <code>TRUE</code> and <code>feature="lex_div"</code> or <code>"readability"</code>,
 a named vector of main
results is returned. For objects containig more than one <code>doc_id</code>,
 defaults to <code>TRUE</code> automatically and
returns a data frame with named rows.</dd>
<dt>index</dt>
<dd>Either a vector indicating which rows should be considered as transformed for the statistics,
or the name of a particular transformation that was previously done to the object,
 if more than one transformation was applied.
If <code>NA</code>, all rows where <code>"equal"</code> is <code>FALSE</code> are used.
Only valid for objects providing a <code>diff</code> feature.</dd>
<dt>feature</dt>
<dd>A character string naming a feature present in the object,
 to trigger a summary regarding that feature.
Currently only <code>"freq"</code>, <code>"lex_div"</code>, and <code>"readability"</code> are implemented.</dd></dl>

Arguments

Summary methods for koRpus objects — summary

<dl>

<dt>object</dt>
<dd>An object of class, <code>kRp.lang</code>, <code>kRp.readability</code>, 
<code>kRp.text</code>, or <code>kRp.TTR</code>.</dd>


<dt>...</dt>
<dd>Further options, depending on the object class.</dd>


<dt>flat</dt>
<dd>Logical, if <code>TRUE</code> and <code>feature="lex_div"</code> or <code>"readability"</code>,
 a named vector of main
results is returned. For objects containig more than one <code>doc_id</code>,
 defaults to <code>TRUE</code> automatically and
returns a data frame with named rows.</dd>


<dt>index</dt>
<dd>Either a vector indicating which rows should be considered as transformed for the statistics,
or the name of a particular transformation that was previously done to the object,
 if more than one transformation was applied.
If <code>NA</code>, all rows where <code>"equal"</code> is <code>FALSE</code> are used.
Only valid for objects providing a <code>diff</code> feature.</dd>


<dt>feature</dt>
<dd>A character string naming a feature present in the object,
 to trigger a summary regarding that feature.
Currently only <code>"freq"</code>, <code>"lex_div"</code>, and <code>"readability"</code> are implemented.</dd>

</dl>

summary: Summary methods for koRpus objects

Description

Usage

Arguments

See Also

Examples