Learn R Programming

quanteda (version 0.9.9-50)

corpus_subset: extract a subset of a corpus

Description

Returns subsets of a corpus that meet certain conditions, including direct logical operations on docvars (document-level variables). corpus_subset functions identically to subset.data.frame, using non-standard evaluation to evaluate conditions based on the docvars in the corpus.

Usage

corpus_subset(x, subset, select, ...)

Arguments

x
corpus object to be subsetted
subset
logical expression indicating elements or rows to keep: missing values are taken as false
select
expression, indicating the attributes to select from the corpus
...
not used

Value

corpus object, with a subset of documents (and docvars) selected according to arguments

See Also

subset.data.frame

Examples

Run this code
summary(corpus_subset(data_corpus_inaugural, Year > 1980))
summary(corpus_subset(data_corpus_inaugural, Year > 1930 & President == "Roosevelt", 
                      select = Year))

Run the code above in your browser using DataLab