tokens_replace

<a rd-options="" href="/link/tokens?package=quanteda&version=0.99.22" data-mini-rdoc="quanteda::tokens">tokens</a> object whose token elements will be replaced

a character vector or <a rd-options="" href="/link/dictionary?package=quanteda&version=0.99.22" data-mini-rdoc="quanteda::dictionary">dictionary</a>. See <a rd-options="" href="/link/pattern?package=quanteda&version=0.99.22" data-mini-rdoc="quanteda::pattern">pattern</a>
for more details.

pattern

if <code>pattern</code> is a character vector, then <code>replacement</code>
must be character vector of equal length, for a 1:1 match. If <code>pattern</code> is
a <a rd-options="" href="/link/dictionary?package=quanteda&version=0.99.22" data-mini-rdoc="quanteda::dictionary">dictionary</a>, then <code>replacement</code> should not be used.

replacement

ignore case when matching, if <code>TRUE</code>

case_insensitive

print status messages if <code>TRUE</code>

verbose

Substitute token types based on vectorized one-to-one matching. Since this
function is created for lemmatization or user-defined stemming, it does not
support multi-word features, or glob and regex patterns. Please use
<code><a rd-options="" href="/link/tokens_lookup?package=quanteda&version=0.99.22" data-mini-rdoc="quanteda::tokens_lookup">tokens_lookup</a></code> with <code>exclusive = FALSE</code> for substitutions
of more complex patterns.

A fast, flexible, and comprehensive framework for
quantitative text analysis in R.  Provides functionality for corpus management,
creating and manipulating tokens and ngrams, exploring keywords in context,
forming and manipulating sparse matrices
of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and
distances, applying content dictionaries, applying supervised and unsupervised machine learning,
visually representing text and text analyses, and more.

Kenneth Benoit

quanteda

Quantitative Analysis of Textual Data

Kohei Watanabe

Paul Nulty

Adam Obeng

Haiyan Wang

Benjamin Lauderdale

Will Lowe

tokens_replace function

<a rd-options='' href='tokens'>tokens</a> object whose token elements will be replaced

a character vector or <a rd-options='' href='dictionary'>dictionary</a>. See <a rd-options='' href='pattern'>pattern</a>
for more details.

if <code>pattern</code> is a character vector, then <code>replacement</code>
must be character vector of equal length, for a 1:1 match. If <code>pattern</code> is
a <a rd-options='' href='dictionary'>dictionary</a>, then <code>replacement</code> should not be used.

Substitute token types based on vectorized one-to-one matching. Since this
function is created for lemmatization or user-defined stemming, it does not
support multi-word features, or glob and regex patterns. Please use
<code><a rd-options='' href='tokens_lookup'>tokens_lookup</a></code> with <code>exclusive = FALSE</code> for substitutions
of more complex patterns.

tokens_replace: replace types in tokens object

Description

Usage

Arguments

Examples