Learn R Programming

tm (version 0.7-18)

Text Mining Package

Description

A framework for text mining applications within R.

Copy Link

Version

Install

install.packages('tm')

Monthly Downloads

39,756

Version

0.7-18

License

GPL-3

Maintainer

Kurt Hornik

Last Published

February 18th, 2026

Functions in tm (0.7-18)

inspect

Inspect Objects
hpc

Parallelized ‘lapply’
readPlain

Read In a Text Document
readDataframe

Read In a Text Document from a Data Frame
findAssocs

Find Associations in a Term-Document Matrix
content_transformer

Content Transformers
findMostFreqTerms

Find Most Frequent Terms
readTagged

Read In a POS-Tagged Word Text Document
readReut21578XML

Read In a Reuters-21578 XML Document
stripWhitespace

Strip Whitespace from a Text Document
readDOC

Read In a MS Word Document
plot

Visualize a Term-Document Matrix
readRCV1

Read In a Reuters Corpus Volume 1 Document
removeNumbers

Remove Numbers from a Text Document
removePunctuation

Remove Punctuation Marks from a Text Document
readXML

Read In an XML Document
findFreqTerms

Find Frequent Terms
termFreq

Term Frequency Vector
crude

20 Exemplary News Articles from the Reuters-21578 Data Set of Topic crude
readPDF

Read In a PDF Document
weightTfIdf

Weight by Term Frequency - Inverse Document Frequency
removeSparseTerms

Remove Sparse Terms from a Term-Document Matrix
foreign

Read Document-Term Matrices
weightSMART

SMART Weightings
weightTf

Weight by Term Frequency
removeWords

Remove Words from a Text Document
TermDocumentMatrix

Term-Document Matrix
meta

Metadata Management
stemDocument

Stem Words
tm_filter

Filter and Index Functions on Corpora
tm_map

Transformations on Corpora
writeCorpus

Write a Corpus to Disk
stopwords

Stopwords
tm_term_score

Compute Score for Matching Terms
stemCompletion

Complete Stems
tm_reduce

Combine Transformations
tokenizer

Tokenizers
weightBin

Weight Binary
PlainTextDocument

Plain Text Documents
PCorpus

Permanent Corpora
Reader

Readers
DataframeSource

Data Frame Source
Corpus

Corpora
SimpleCorpus

Simple Corpora
Source

Sources
DirSource

Directory Source
TextDocument

Text Documents
Docs

Access Document IDs and Terms
URISource

Uniform Resource Identifier Source
XMLSource

XML Source
tm_combine

Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors
WeightFunction

Weighting Function
VectorSource

Vector Source
acq

50 Exemplary News Articles from the Reuters-21578 Data Set of Topic acq
VCorpus

Volatile Corpora
XMLTextDocument

XML Text Documents
getTransformations

Transformations
getTokenizers

Tokenizers
Zipf_n_Heaps

Explore Corpus Term Frequency Characteristics
ZipSource

ZIP File Source