dfm_subset

Returns document subsets of a dfm that meet certain conditions,
including direct logical operations on docvars (document-level variables).
<code>dfm_subset</code> functions identically to <code><a href="/link/subset.data.frame()?package=quanteda&version=4.0.1" data-mini-rdoc="quanteda::subset.data.frame()">subset.data.frame()</a></code>,
using non-standard evaluation to evaluate conditions based on the
docvars in the dfm.

A fast, flexible, and comprehensive framework for
quantitative text analysis in R.  Provides functionality for corpus management,
creating and manipulating tokens and n-grams, exploring keywords in context,
forming and manipulating sparse matrices
of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and
distances, applying content dictionaries, applying supervised and unsupervised machine learning,
visually representing text and text analyses, and more.

Kenneth Benoit

quanteda

Quantitative Analysis of Textual Data

Kohei Watanabe

Haiyan Wang

Paul Nulty

Adam Obeng

Stefan Müller

Akitaka Matsuo

William Lowe

Christian Müller

Olivier Delmarcelle

European Research Council 

dfm_subset function

<dl><dt>x</dt>
<dd>dfm object to be subsetted.</dd>
<dt>subset</dt>
<dd>logical expression indicating the documents to keep: missing
values are taken as false.</dd>
<dt>min_ntoken, max_ntoken</dt>
<dd>minimum and maximum lengths of the documents to extract.</dd>
<dt>drop_docid</dt>
<dd>if <code>TRUE</code>, <code>docid</code> for documents are removed as the result
of subsetting.</dd>
<dt>...</dt>
<dd>not used</dd></dl>

Arguments

Returns document subsets of a dfm that meet certain conditions,
including direct logical operations on docvars (document-level variables).
<code>dfm_subset</code> functions identically to <code><a href='https://rdrr.io/r/base/subset.html'>subset.data.frame()</a></code>,
using non-standard evaluation to evaluate conditions based on the
docvars in the dfm.

Extract a subset of a dfm — dfm_subset

<dl>

<dt>x</dt>
<dd>dfm object to be subsetted.</dd>


<dt>subset</dt>
<dd>logical expression indicating the documents to keep: missing
values are taken as false.</dd>


<dt>min_ntoken, max_ntoken</dt>
<dd>minimum and maximum lengths of the documents to extract.</dd>


<dt>drop_docid</dt>
<dd>if <code>TRUE</code>, <code>docid</code> for documents are removed as the result
of subsetting.</dd>


<dt>...</dt>
<dd>not used</dd>

</dl>

dfm_subset: Extract a subset of a dfm

Description

Usage

Value

Arguments

Details

See Also

Examples