Method to trim and adjust objects by applying thresholds, minimum frequencies
etc. It can be applied to context, features, context, partition and
partition_bundle objects.
A character vector with terms to exclude from matrix
(terms used as stopwords).
docs_to_keep
A character vector with documents to keep.
min_count
A numeric value with a minimum value of total term
frequency across documents to exclude rare terms from matrix.
min_doc_length
A numeric value with minimum total of the summed-up
occurrence of tokens in a document. Exclude documents below this value and
filter out short documents. Note that the min_doc_length filter is applied
before filtering for min_count and terms_to_keep, and that these filters
will reduce document lengths.
verbose
A logical value, whether to output progress messages.