AlphabeticTokenizer

NGramTokenizer

WordTokenizer

character

Weka_tokenizers

An R interface to Weka (Version 3.9.3).
Weka is a collection of machine learning algorithms for data mining
tasks written in Java, containing tools for data pre-processing,
classification, regression, clustering, association rules, and
visualization. Package 'RWeka' contains the interface code, the
Weka jar is in a separate package 'RWekajars'. For more information
on Weka see <https://www.cs.waikato.ac.nz/ml/weka/>.

Kurt Hornik

RWeka

R/Weka Interface

Christian Buchta

Torsten Hothorn

Alexandros Karatzoglou

David Meyer

Achim Zeileis

Weka_tokenizers function

<dl> <dt>x</dt>
<dd>a character vector with strings to be tokenized.</dd> <dt>control</dt>
<dd>an object of class <code>Weka_control</code>, or a
 character vector of control options, or <code>NULL</code> (default).
 Available options can be obtained on-line using the Weka Option
 Wizard <code>WOW</code>, or the Weka documentation.</dd></dl>

Arguments

R/Weka Tokenizers — Weka_tokenizers

<dl>

 <dt>x</dt>
<dd>a character vector with strings to be tokenized.</dd>

 <dt>control</dt>
<dd>an object of class <code>Weka_control</code>, or a
 character vector of control options, or <code>NULL</code> (default).
 Available options can be obtained on-line using the Weka Option
 Wizard <code>WOW</code>, or the Weka documentation.</dd>

</dl>

R/Weka Tokenizers

Weka_tokenizers: R/Weka Tokenizers

Description

Usage

Value

Arguments

Details