Returns subsets of a corpus that meet certain conditions, including direct
logical operations on docvars (document-level variables). corpus_subset
functions identically to subset.data.frame, using non-standard
evaluation to evaluate conditions based on the docvars in the corpus.
Usage
corpus_subset(x, subset, select, ...)
Arguments
x
corpus object to be subsetted
subset
logical expression indicating elements or rows to keep: missing
values are taken as false
select
expression, indicating the attributes to select from the corpus
...
not used
Value
corpus object, with a subset of documents (and docvars) selected according to arguments
summary(corpus_subset(data_corpus_inaugural, Year > 1980))
summary(corpus_subset(data_corpus_inaugural, Year > 1930 & President == "Roosevelt",
select = Year))