Learn R Programming

quanteda (version 4.0.1)

ndoc: Count the number of documents or features

Description

Get the number of documents or features in an object.

Usage

ndoc(x)

nfeat(x)

Value

ndoc() returns an integer count of the number of documents in an object whose texts are organized as "documents" (a corpus, dfm, or tokens/tokens_xptr object.

nfeat() returns an integer count of the number of features. It is an alias for ntype() for a dfm. This function is only defined for dfm

objects because only these have "features".

Arguments

x

a quanteda object: a corpus, dfm, tokens, or tokens_xptr object, or a readtext object from the readtext package

See Also

ntoken(), ntype()

Examples

Run this code
# number of documents
ndoc(data_corpus_inaugural)
ndoc(corpus_subset(data_corpus_inaugural, Year > 1980))
ndoc(tokens(data_corpus_inaugural))
ndoc(dfm(tokens(corpus_subset(data_corpus_inaugural, Year > 1980))))

# number of features
toks1 <- tokens(corpus_subset(data_corpus_inaugural, Year > 1980), remove_punct = FALSE)
toks2 <- tokens(corpus_subset(data_corpus_inaugural, Year > 1980), remove_punct = TRUE)
nfeat(dfm(toks1))
nfeat(dfm(toks2))

Run the code above in your browser using DataLab