cluster_frequency

Frequencies of an existing cluster object

It offers functions for splitting, parsing, tokenizing and creating a vocabulary for big text data files. Moreover, it includes functions for building a document-term matrix and extracting information from those (term-associations, most frequent terms). It also embodies functions for calculating token statistics (collocations, look-up tables, string dissimilarities) and functions to work with sparse matrices. Lastly, it includes functions for Word Vector Representations (i.e. 'GloVe', 'fasttext') and incorporates functions for the calculation of (pairwise) text document dissimilarities. The source code is based on 'C++11' and exported in R through the 'Rcpp', 'RcppArmadillo' and 'BH' packages.

Lampros Mouselimis

textTinyR

Text Processing for Small or Big Data Files

cluster_frequency function

<dl><dt>tokenized_list_text</dt>
<dd>a list of tokenized text documents. This can be the result of the textTinyR::tokenize_transform_vec_docs function with the as_token parameter set to TRUE (the token object of the output)</dd>
<dt>cluster_vector</dt>
<dd>a numeric vector. This can be the result of the ClusterR::KMeans_rcpp function (the clusters object of the output)</dd>
<dt>verbose</dt>
<dd>either TRUE or FALSE. If TRUE then information will be printed out in the R session.</dd></dl>

Arguments

Frequencies of an existing cluster object — cluster_frequency

<dl>

<dt>tokenized_list_text</dt>
<dd>a list of tokenized text documents. This can be the result of the textTinyR::tokenize_transform_vec_docs function with the as_token parameter set to TRUE (the token object of the output)</dd>


<dt>cluster_vector</dt>
<dd>a numeric vector. This can be the result of the ClusterR::KMeans_rcpp function (the clusters object of the output)</dd>


<dt>verbose</dt>
<dd>either TRUE or FALSE. If TRUE then information will be printed out in the R session.</dd>

</dl>

cluster_frequency: Frequencies of an existing cluster object

Description

Usage

Value

Arguments

Details

Examples