tokens_lookup

tokens object to which dictionary or thesaurus will be supplied

the <a rd-options="" href="/link/dictionary?package=quanteda&version=0.99.12" data-mini-rdoc="quanteda::dictionary">dictionary</a>-class object that will be applied to 
<code>x</code>

dictionary

integers specifying the levels of entries in a hierarchical 
dictionary that will be applied. The top level is 1, and subsequent levels
describe lower nesting levels. Values may be combined, even if these 
levels are not contiguous, e.g. `levels = c(1:3)` will collapse the second 
level into the first, but record the third level (if present) collapsed
below the first. (See examples.)

levels

the type of pattern matching: <code>"glob"</code> for 
"glob"-style wildcard expressions; <code>"regex"</code> for regular expressions;
or <code>"fixed"</code> for exact matching. See <a rd-options="" href="/link/valuetype?package=quanteda&version=0.99.12" data-mini-rdoc="quanteda::valuetype">valuetype</a> for details.

valuetype

ignore the case of dictionary values if <code>TRUE</code> 
uppercase to distinguish them from other features

case_insensitive

if TRUE, convert dictionary keys to uppercase to distinguish 
them from other features

capkeys

if <code>TRUE</code>, remove all features not in dictionary, 
otherwise, replace values in dictionary with keys while leaving other 
features unaffected

exclusive

an optional character naming a new key for tokens that do not
matched to a dictionary values If <code>NULL</code> (default), do not record
unmatched tokens.

nomatch

print status messages if <code>TRUE</code>

verbose

Convert tokens into equivalence classes defined by values of a dictionary 
object.

A fast, flexible, and comprehensive framework for
quantitative text analysis in R.  Provides functionality for corpus management,
creating and manipulating tokens and ngrams, exploring keywords in context,
forming and manipulating sparse matrices
of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and
distances, applying content dictionaries, applying supervised and unsupervised machine learning,
visually representing text and text analyses, and more.

Kenneth Benoit

quanteda

Quantitative Analysis of Textual Data

Kohei Watanabe

Paul Nulty

Adam Obeng

Haiyan Wang

Benjamin Lauderdale

Will Lowe

tokens_lookup function

the <a rd-options='' href='dictionary'>dictionary</a>-class object that will be applied to 
<code>x</code>

the type of pattern matching: <code>"glob"</code> for 
"glob"-style wildcard expressions; <code>"regex"</code> for regular expressions;
or <code>"fixed"</code> for exact matching. See <a rd-options='' href='valuetype'>valuetype</a> for details.

tokens_lookup: apply a dictionary to a tokens object

Description

Usage

Arguments

Examples