encoding

Detect the encoding of texts in a character readtext object and report
on the most likely encoding for each document. Useful in detecting the
encoding of input texts, so that a source encoding can be (re)specified when
inputting a set of texts using <code>readtext()</code>, prior to constructing
a corpus.

Functions for importing and handling text files and formatted text
files with additional meta-data, such including '.csv', '.tab', '.json', '.xml',
'.html', '.pdf', '.doc', '.docx', '.rtf', '.xls', '.xlsx', and others.

Kenneth Benoit

readtext

Import and Handling for Plain and Formatted Text Files

Adam Obeng

Kohei Watanabe

Akitaka Matsuo

Paul Nulty

Stefan Müller

encoding function

<dl><dt>x</dt>
<dd>character vector, corpus, or readtext object whose texts' encodings
will be detected.</dd>
<dt>verbose</dt>
<dd>if <code>FALSE</code>, do not print diagnostic report</dd>
<dt>...</dt>
<dd>additional arguments passed to <a href="/link/stri_enc_detect?package=readtext&version=0.91" data-mini-rdoc="readtext::stri_enc_detect">stri_enc_detect</a></dd></dl>

Arguments

detect the encoding of texts — encoding

<dl>

<dt>x</dt>
<dd>character vector, corpus, or readtext object whose texts' encodings
will be detected.</dd>


<dt>verbose</dt>
<dd>if <code>FALSE</code>, do not print diagnostic report</dd>


<dt>...</dt>
<dd>additional arguments passed to <a href='https://rdrr.io/pkg/stringi/man/stri_enc_detect.html'>stri_enc_detect</a></dd>

</dl>

encoding: detect the encoding of texts

Description

Usage

Arguments

Details

Examples