Learn R Programming

⚠️There's a newer version (0.7-16) of this package.Take me there.

tm (version 0.5-3)

Text Mining Package

Description

A framework for text mining applications within R.

Copy Link

Version

Install

install.packages('tm')

Monthly Downloads

55,818

Version

0.5-3

License

GPL (>= 2)

Maintainer

Ingo Feinerer

Last Published

February 19th, 2010

Functions in tm (0.5-3)

plot

Visualize a Term-Document Matrix
getSources

List Available Sources
convert_UTF_8

Convert Encoding to UTF-8
makeChunks

Split a Corpus into Chunks
acq

50 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic acq
DataframeSource

Data Frame Source
TextRepository

Text Repository
stemDocument

Stem Words
readRCV1

Read In a Reuters Corpus Volume 1 Document
Source

Access Sources
tm_combine

Combine Corpora, Documents, and Term-Document Matrices
PCorpus

Permanent Corpus Constructor
preprocessReut21578XML

Preprocess the Reuters-21578 XML archive.
weightBin

Weight Binary
crude

20 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic crude
readReut21578XML

Read In a Reuters-21578 XML Document
getTransformations

List Available Transformations
removeSparseTerms

Remove Sparse Terms from a Term-Document Matrix
inspect

Inspect Objects
GmaneSource

Gmane Source
searchFullText

Full Text Search
FunctionGenerator

Function Generator
sFilter

Statement Filter
tm_filter

Filter and Index Functions on Corpora
dissimilarity

Dissimilarity
removeWords

Remove Words from a Text Document
termFreq

Term Frequency Vector
weightTfIdf

Weight by Term Frequency - Inverse Document Frequency
findAssocs

Find Associations in a Term-Document Matrix
tm_intersect

Intersection between Documents and Words
weightTf

Weight by Term Frequency
number

The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix
findFreqTerms

Find Frequent Terms
readPDF

Read In a PDF Document
writeCorpus

Write a Corpus to Disk
readGmane

Read In a Gmane RSS Feed
tm_map

Transformations on Corpora
tm_reduce

Combine Transformations
WeightFunction

Weighting Function
VCorpus

Volatile Corpus
TextDocument

Access and Modify Text Documents
tm_cluster

Allow `tm' to Use a Cluster
prescindMeta

Prescind Document Meta Data
readDOC

Read In a MS Word Document
readPlain

Read In a Text Document
as.PlainTextDocument

Create Objects of Class PlainTextDocument
removeNumbers

Remove Numbers from a Text Document
getFilters

List Available Filters
stopwords

Multilingual Stopwords
removePunctuation

Remove Punctuation Marks from a Text Document
RCV1Document

RCV1 Text Document
Reuters21578Document

Reuters-21578 Text Document
materialize

Materialize Lazy Mappings
PlainTextDocument

Plain Text Document
readXML

Read In an XML Document
ReutersSource

Reuters-21578 XML Source
DirSource

Directory Source
VectorSource

Vector Source
names

Row, Column, Dim Names, Document IDs, and Terms
meta

Meta Data Management
stemCompletion

Complete Stems
XMLSource

XML Source
Dictionary

Dictionary
URISource

Uniform Resource Identifier Source
getReaders

List Available Readers
TermDocumentMatrix

Term-Document Matrix
readTabular

Read In a Text Document
stripWhitespace

Strip Whitespace from a Text Document