Learn R Programming

SpeedReader (version 0.9.1)

speed_set_vocabulary: A function the reorgaizes vocabulary to speed up document term matrix formation using a string stem dictionary.

Description

A function the reorgaizes vocabulary to speed up document term matrix formation using a string stem dictionary.

Usage

speed_set_vocabulary(vocab, term_frequency_threshold = 0, cores = 1)

Arguments

vocab

A vocabulary list object returned by the count_words() function.

term_frequency_threshold

A threshold below which all words appearing fewer than that many times in the corpus will be removed. Defaults to 0 in which case no words will be removed.

cores

The number of cores we wish to use for parallelization to speed up computation. Defaults to 1.

Value

A vocabulary list object.