powered by
Most frequent words of the corpus.
frequentwords( corpus, nb, mincount = 5, minphrasecount = NULL, ngram = 1, lang = "en", stopwords = lang )
The most frequent words of the corpus.
The corpus of documents (a vector of characters) or the vocabulary of the documents (result of function getvocab).
getvocab
The number of words to be returned.
Minimum word count to be considered as frequent.
Minimum collocation of words count to be considered as frequent.
maximum size of n-grams.
The language of the documents (NULL if no stemming).
Stopwords, or the language of the documents. NULL if stop words should not be removed.
if (FALSE) { text = loadtext ("http://mattmahoney.net/dc/text8.zip") frequentwords (text, 100) vocab = getvocab (text) frequentwords (vocab, 100) }
Run the code above in your browser using DataLab