doc_counts

Computes some basic document counts (see the 'Value' section below for
details).
The function is vectorized by document, and scores are computed in parallel
via OpenMP. You can control the number of threads used with the
<code>nthreads</code> parameter.

An English language syllable counter, plus readability score
measure-er. For readability, we support 'Flesch' Reading Ease and
'Flesch-Kincaid' Grade Level ('Kincaid' 'et al'. 1975)
<https://stars.library.ucf.edu/cgi/viewcontent.cgi?article=1055&context=istlibrary>,
Automated Readability Index ('Senter' and Smith 1967)
<https://apps.dtic.mil/sti/citations/AD0667273>,
Simple Measure of Gobbledygook (McLaughlin 1969),
and 'Coleman-Liau' (Coleman and 'Liau' 1975) <doi:10.1037/h0076540>. The
package has been carefully optimized and should be very efficient, both in
terms of run time performance and memory consumption. The main methods are
'vectorized' by document, and scores for multiple documents are computed in
parallel via 'OpenMP'.

Drew Schmidt

sylcount

Syllable Counting and Readability Measurements

doc_counts function

<dl><dt>s</dt>
<dd>A character vector (vector of strings).</dd>
<dt>nthreads</dt>
<dd>Number of threads to use. By default it will use the total number of
cores + hyperthreads.</dd></dl>

Arguments

doc_counts — doc_counts

<dl>

<dt>s</dt>
<dd>A character vector (vector of strings).</dd>


<dt>nthreads</dt>
<dd>Number of threads to use. By default it will use the total number of
cores + hyperthreads.</dd>

</dl>

`chars`	the total numberof characters
`wordchars`	the number of alphanumeric characters
`words`	text tokens that are probably English language words
`nonwords`	text tokens that are probably not English language words
`sents`	the number of sentences recognized in the text
`sylls`	the total number of syllables (ignores all non-words)
`polys`	the total number of "polysyllables", or words with 3+ syllables

doc_counts: doc_counts

Description

Usage

Value

Arguments

Details

See Also

Examples