Rdocumentation
powered by
Learn R Programming
⚠️
There's a newer version (0.7-16) of this package.
Take me there.
tm (version 0.5-3)
Text Mining Package
Description
A framework for text mining applications within R.
Copy Link
Link to current version
Version
Version
0.7-16
0.7-15
0.7-14
0.7-12
0.7-11
0.7-10
0.7-9
0.7-8
0.7-7
0.7-6
0.7-5
0.7-4
0.7-3
0.7-2
0.7-1
0.6-2
0.6-1
0.5-10
0.5-9.1
0.5-8.3
0.5-8.1
0.5-7.1
0.5-6
0.5-5
0.5-4.1
0.5-3
0.5-2
0.5-1
0.4
0.3-4.1
0.3-3
0.3-2
0.3-1
0.2-3.7
0.2-1
0.1-1
Install
install.packages('tm')
Monthly Downloads
55,818
Version
0.5-3
License
GPL (>= 2)
Maintainer
Ingo Feinerer
Last Published
February 19th, 2010
Functions in tm (0.5-3)
Search all functions
plot
Visualize a Term-Document Matrix
getSources
List Available Sources
convert_UTF_8
Convert Encoding to UTF-8
makeChunks
Split a Corpus into Chunks
acq
50 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic acq
DataframeSource
Data Frame Source
TextRepository
Text Repository
stemDocument
Stem Words
readRCV1
Read In a Reuters Corpus Volume 1 Document
Source
Access Sources
tm_combine
Combine Corpora, Documents, and Term-Document Matrices
PCorpus
Permanent Corpus Constructor
preprocessReut21578XML
Preprocess the Reuters-21578 XML archive.
weightBin
Weight Binary
crude
20 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic crude
readReut21578XML
Read In a Reuters-21578 XML Document
getTransformations
List Available Transformations
removeSparseTerms
Remove Sparse Terms from a Term-Document Matrix
inspect
Inspect Objects
GmaneSource
Gmane Source
searchFullText
Full Text Search
FunctionGenerator
Function Generator
sFilter
Statement Filter
tm_filter
Filter and Index Functions on Corpora
dissimilarity
Dissimilarity
removeWords
Remove Words from a Text Document
termFreq
Term Frequency Vector
weightTfIdf
Weight by Term Frequency - Inverse Document Frequency
findAssocs
Find Associations in a Term-Document Matrix
tm_intersect
Intersection between Documents and Words
weightTf
Weight by Term Frequency
number
The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix
findFreqTerms
Find Frequent Terms
readPDF
Read In a PDF Document
writeCorpus
Write a Corpus to Disk
readGmane
Read In a Gmane RSS Feed
tm_map
Transformations on Corpora
tm_reduce
Combine Transformations
WeightFunction
Weighting Function
VCorpus
Volatile Corpus
TextDocument
Access and Modify Text Documents
tm_cluster
Allow `tm' to Use a Cluster
prescindMeta
Prescind Document Meta Data
readDOC
Read In a MS Word Document
readPlain
Read In a Text Document
as.PlainTextDocument
Create Objects of Class PlainTextDocument
removeNumbers
Remove Numbers from a Text Document
getFilters
List Available Filters
stopwords
Multilingual Stopwords
removePunctuation
Remove Punctuation Marks from a Text Document
RCV1Document
RCV1 Text Document
Reuters21578Document
Reuters-21578 Text Document
materialize
Materialize Lazy Mappings
PlainTextDocument
Plain Text Document
readXML
Read In an XML Document
ReutersSource
Reuters-21578 XML Source
DirSource
Directory Source
VectorSource
Vector Source
names
Row, Column, Dim Names, Document IDs, and Terms
meta
Meta Data Management
stemCompletion
Complete Stems
XMLSource
XML Source
Dictionary
Dictionary
URISource
Uniform Resource Identifier Source
getReaders
List Available Readers
TermDocumentMatrix
Term-Document Matrix
readTabular
Read In a Text Document
stripWhitespace
Strip Whitespace from a Text Document