
Find frequent terms in a document-term or term-document matrix.
findFreqTerms(x, lowfreq = 0, highfreq = Inf)
A character vector of terms in x
which occur more or equal often
than lowfreq
times and less or equal often than highfreq
times.
A DocumentTermMatrix
or
TermDocumentMatrix
.
A numeric for the lower frequency bound.
A numeric for the upper frequency bound.
This method works for all numeric weightings but is probably
most meaningful for the standard term frequency (tf
) weighting
of x
.
data("crude")
tdm <- TermDocumentMatrix(crude)
findFreqTerms(tdm, 2, 3)
Run the code above in your browser using DataLab