Returns subsets of a corpus that meet certain conditions, including direct
logical operations on docvars (document-level variables). corpus_subset
functions identically to subset.data.frame(), using non-standard
evaluation to evaluate conditions based on the docvars in the corpus.
Usage
corpus_subset(x, subset, drop_docid = TRUE, ...)
Value
corpus object, with a subset of documents (and docvars) selected according to arguments
Arguments
x
corpus object to be subsetted.
subset
logical expression indicating the documents to keep: missing
values are taken as false.
drop_docid
if TRUE, docid for documents are removed as the result
of subsetting.