Rdocumentation
powered by
Learn R Programming
tm (version 0.7-16)
Text Mining Package
Description
A framework for text mining applications within R.
Copy Link
Link to current version
Version
Version
0.7-16
0.7-15
0.7-14
0.7-12
0.7-11
0.7-10
0.7-9
0.7-8
0.7-7
0.7-6
0.7-5
0.7-4
0.7-3
0.7-2
0.7-1
0.6-2
0.6-1
0.5-10
0.5-9.1
0.5-8.3
0.5-8.1
0.5-7.1
0.5-6
0.5-5
0.5-4.1
0.5-3
0.5-2
0.5-1
0.4
0.3-4.1
0.3-3
0.3-2
0.3-1
0.2-3.7
0.2-1
0.1-1
Install
install.packages('tm')
Monthly Downloads
42,308
Version
0.7-16
License
GPL-3
Maintainer
Kurt Hornik
Last Published
February 19th, 2025
Functions in tm (0.7-16)
Search all functions
URISource
Uniform Resource Identifier Source
WeightFunction
Weighting Function
VectorSource
Vector Source
VCorpus
Volatile Corpora
XMLSource
XML Source
tm_combine
Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors
ZipSource
ZIP File Source
acq
50 Exemplary News Articles from the Reuters-21578 Data Set of Topic acq
XMLTextDocument
XML Text Documents
Zipf_n_Heaps
Explore Corpus Term Frequency Characteristics
inspect
Inspect Objects
findFreqTerms
Find Frequent Terms
foreign
Read Document-Term Matrices
getTokenizers
Tokenizers
getTransformations
Transformations
findMostFreqTerms
Find Most Frequent Terms
hpc
Parallelized ‘lapply’
findAssocs
Find Associations in a Term-Document Matrix
content_transformer
Content Transformers
crude
20 Exemplary News Articles from the Reuters-21578 Data Set of Topic crude
readPlain
Read In a Text Document
readRCV1
Read In a Reuters Corpus Volume 1 Document
readReut21578XML
Read In a Reuters-21578 XML Document
readDOC
Read In a MS Word Document
plot
Visualize a Term-Document Matrix
readTagged
Read In a POS-Tagged Word Text Document
readDataframe
Read In a Text Document from a Data Frame
TermDocumentMatrix
Term-Document Matrix
meta
Metadata Management
readPDF
Read In a PDF Document
termFreq
Term Frequency Vector
stripWhitespace
Strip Whitespace from a Text Document
readXML
Read In an XML Document
removePunctuation
Remove Punctuation Marks from a Text Document
removeNumbers
Remove Numbers from a Text Document
stemDocument
Stem Words
removeSparseTerms
Remove Sparse Terms from a Term-Document Matrix
removeWords
Remove Words from a Text Document
stemCompletion
Complete Stems
stopwords
Stopwords
tm_term_score
Compute Score for Matching Terms
tokenizer
Tokenizers
writeCorpus
Write a Corpus to Disk
weightTfIdf
Weight by Term Frequency - Inverse Document Frequency
tm_filter
Filter and Index Functions on Corpora
tm_reduce
Combine Transformations
weightBin
Weight Binary
weightSMART
SMART Weightings
tm_map
Transformations on Corpora
weightTf
Weight by Term Frequency
DirSource
Directory Source
Docs
Access Document IDs and Terms
Reader
Readers
SimpleCorpus
Simple Corpora
Corpus
Corpora
DataframeSource
Data Frame Source
Source
Sources
TextDocument
Text Documents
PCorpus
Permanent Corpora
PlainTextDocument
Plain Text Documents