Learn R Programming

mallet (version 1.3.0)

An R Wrapper for the Java Mallet Topic Modeling Toolkit

Description

An R interface for the Java Machine Learning for Language Toolkit (mallet) to estimate probabilistic topic models, such as Latent Dirichlet Allocation. We can use the R package to read textual data into mallet from R objects, run the Java implementation of mallet directly in R, and extract results as R objects. The Mallet toolkit has many functions, this wrapper focuses on the topic modeling sub-package written by David Mimno. The package uses the rJava package to connect to a JVM.

Copy Link

Version

Install

install.packages('mallet')

Version

1.3.0

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Last Published

July 20th, 2022

Functions in mallet (1.3.0)

MalletLDA

Create a Mallet topic model trainer
mallet_stoplist_file_path

Return the file path to the mallet stoplists
load.mallet.state

Load a Mallet state into Mallet
mallet_supported_stoplists

Mallet supported stoplists
mallet.doc.topics

Retrieve a matrix of topic weights for every document
save.mallet.state

Save a Mallet state to file
mallet-package

An R Wrapper for the Java Mallet Topic Modeling Toolkit
save.mallet.instances

Load and save mallet instances from/to file
mallet.read.dir

Import documents from a directory into Mallet format
mallet.import

Import text documents into Mallet format
mallet.topic.model.read

Load (read) and save (write) a topic from a file
mallet.topic.labels

Get strings containing the most probable words for each topic
mallet.topic.hclust

Return a hierarchical clustering of topics
mallet.subset.topic.words

Estimate topic-word distributions from a sub-corpus
mallet_jar

Return the mallet jar filename(s)
mallet.word.freqs

Descriptive statistics of word frequencies
mallet.top.words

Get the most probable words and their probabilities for one topic
sotu

State of the Union Adresses.
mallet.topic.words

Retrieve a matrix of words weights for topics