A function the reorgaizes vocabulary to speed up document term matrix formation using a string stem dictionary.
speed_set_vocabulary(vocab, term_frequency_threshold = 0, cores = 1)
A vocabulary list object returned by the count_words() function.
A threshold below which all words appearing fewer than that many times in the corpus will be removed. Defaults to 0 in which case no words will be removed.
The number of cores we wish to use for parallelization to speed up computation. Defaults to 1.
A vocabulary list object.