Rdocumentation
powered by
Learn R Programming
⚠️
There's a newer version (0.7-14) of this package.
Take me there.
tm (version 0.7-7)
Text Mining Package
Description
A framework for text mining applications within R.
Copy Link
Link to current version
Version
Version
0.7-14
0.7-12
0.7-11
0.7-10
0.7-9
0.7-8
0.7-7
0.7-6
0.7-5
0.7-4
0.7-3
0.7-2
0.7-1
0.6-2
0.6-1
0.5-10
0.5-9.1
0.5-8.3
0.5-8.1
0.5-7.1
0.5-6
0.5-5
0.5-4.1
0.5-3
0.5-2
0.5-1
0.4
0.3-4.1
0.3-3
0.3-2
0.3-1
0.2-3.7
0.2-1
0.1-1
Install
install.packages('tm')
Monthly Downloads
53,395
Version
0.7-7
License
GPL-3
Maintainer
Ingo Feinerer
Last Published
December 12th, 2019
Functions in tm (0.7-7)
Search all functions
Source
Sources
PCorpus
Permanent Corpora
TextDocument
Text Documents
Zipf_n_Heaps
Explore Corpus Term Frequency Characteristics
VectorSource
Vector Source
readDOC
Read In a MS Word Document
WeightFunction
Weighting Function
inspect
Inspect Objects
plot
Visualize a Term-Document Matrix
hpc
Parallelized ‘lapply’
Reader
Readers
PlainTextDocument
Plain Text Documents
tm_filter
Filter and Index Functions on Corpora
SimpleCorpus
Simple Corpora
crude
20 Exemplary News Articles from the Reuters-21578 Data Set of Topic crude
Corpus
Corpora
weightSMART
SMART Weightings
Docs
Access Document IDs and Terms
ZipSource
ZIP File Source
DirSource
Directory Source
weightTfIdf
Weight by Term Frequency - Inverse Document Frequency
stripWhitespace
Strip Whitespace from a Text Document
VCorpus
Volatile Corpora
DataframeSource
Data Frame Source
findAssocs
Find Associations in a Term-Document Matrix
readDataframe
Read In a Text Document from a Data Frame
findFreqTerms
Find Frequent Terms
getTokenizers
Tokenizers
URISource
Uniform Resource Identifier Source
getTransformations
Transformations
termFreq
Term Frequency Vector
weightTf
Weight by Term Frequency
readXML
Read In an XML Document
tokenizer
Tokenizers
meta
Metadata Management
TermDocumentMatrix
Term-Document Matrix
stemDocument
Stem Words
tm_combine
Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors
acq
50 Exemplary News Articles from the Reuters-21578 Data Set of Topic acq
removePunctuation
Remove Punctuation Marks from a Text Document
content_transformer
Content Transformers
readPlain
Read In a Text Document
readPDF
Read In a PDF Document
weightBin
Weight Binary
readReut21578XML
Read In a Reuters-21578 XML Document
removeNumbers
Remove Numbers from a Text Document
stopwords
Stopwords
writeCorpus
Write a Corpus to Disk
findMostFreqTerms
Find Most Frequent Terms
tm_map
Transformations on Corpora
readTagged
Read In a POS-Tagged Word Text Document
removeSparseTerms
Remove Sparse Terms from a Term-Document Matrix
XMLSource
XML Source
removeWords
Remove Words from a Text Document
XMLTextDocument
XML Text Documents
tm_term_score
Compute Score for Matching Terms
foreign
Read Document-Term Matrices
readRCV1
Read In a Reuters Corpus Volume 1 Document
stemCompletion
Complete Stems
tm_reduce
Combine Transformations