Learn R Programming

⚠️There's a newer version (0.7-14) of this package.Take me there.

tm (version 0.4)

Text Mining Package

Description

A framework for text mining applications within R.

Copy Link

Version

Install

install.packages('tm')

Monthly Downloads

53,395

Version

0.4

License

GPL-2

Maintainer

Last Published

July 1st, 2009

Functions in tm (0.4)

DublinCore-methods

Methods for Function DublinCore in Package `tm'
asPlain-methods

Methods for Function asPlain in Package `tm'
%IN%-methods

Methods for Function %IN% in Package `tm'
stemDoc-methods

Methods for Function stemDoc in Package `tm'
convertRCV1Plain

Transform a RCV1 Document to a Plain Text Document
VCorpus-class

Volatile Corpus
Corpus-class

Corpus
DataframeSource

Data Frame Source
prescindMeta

Prescind meta data
URISource

Uniform Resource Identifier Source
XMLTextDocument-class

Text document
PCorpus-class

Permanent Corpus
URISource-class

Source for Directories
removeSignature-methods

Methods for Function removeSignature in Package `tm'
GmaneSource-class

Source for Gmane Feeds
makeChunks

Split a Corpus into Chunks
readPDF

Read In a PDF Document
PCorpus

Permanent Corpus Constructor
readGmane

Read In A Newsgroup Document
readDOC

Read In a MS Word Document
WeightFunction-class

Weighting Function
c-methods

Methods for Function c in Package `tm'
XMLSource

XML Source
TextDocument-class

Text Document
DirSource

Directory Source
activateCluster

Allow `tm' to Use a Cluster If Available
getFilters

Get Available Filters
crude

20 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic crude
StructuredTextDocument-class

Structured Text Document
readTabular

Read In a Text Document
FunctionGenerator-class

Function Generator
appendElem-methods

Methods for Function appendElem in Package `tm'
tmIndex-methods

Methods for Function tmIndex in Package `tm'
removeWords-methods

Methods for Function removeWords in Package `tm'
findAssocs

Find Associations in a Term-Document Matrix
RCV1Document-class

RCV1 Text Document
getElem-methods

Methods for Function getElem in Package `tm'
VectorSource

Gmane Source
tmFilter-methods

Methods for Function tmFilter in Package `tm'
FunctionGenerator

Function Generator Constructor
VectorSource-class

Source for Vectors
readRCV1

Read In a Reuters Corpus Volume 1 Document
getReaders

Get Available Readers
eoi-methods

Methods for Function eoi in Package `tm'
number

The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix
weightTf

Weight By Term Frequency
GmaneSource

Gmane Source
TextRepository

Text Repository
WeightFunction

Weighting Function Constructor
readXML

Read In an XML Document
ReutersSource

Reuters Source
tmIntersect-methods

Methods for Function tmIntersect in Package `tm'
pGetElem-methods

Methods for Function pGetElem in Package `tm'
DataframeSource-class

Source for Data Frames
readPlain

Read In a Text Document
tmMap-methods

Methods for Function tmMap in Package `tm'
dissimilarity

Dissimilarity
Dictionary-class

Dictionary
materialize

Materialize Lazy Mappings
getTransformations

Get Available Transformations
DirSource-class

Source for Directories
MetaDataNode-class

Metadata Node
NewsgroupDocument-class

Newsgroup Text Document
show-methods

Methods for Function show in Package `tm'
TextRepository-class

Text Repository
deactivateCluster

Disallow `tm' to Use a Cluster
replacePatterns-methods

Methods for Function replacePatterns in Package `tm'
removeSparseTerms

Remove Sparse Terms from a Term-Document Matrix
stemCompletion

Complete Stems
plot

Visualize a Term-Document Matrix
convertMboxEml

Convert E-Mails From mbox Format To eml Format
Source-class

Source
stopwords

Multilingual Stopwords
getSources

Get Available Sources
meta-methods

Methods for Function meta in Package `tm'
removeMultipart-methods

Methods for Function removeMultipart in Package `tm'
preprocessReut21578XML

Preprocess the Reuters21578 XML archive.
names

Row, Column, Dim Names, Document IDs, and Terms
stepNext-methods

Methods for Function stepNext in Package `tm'
Dictionary

Dictionary
TermDocumentMatrix

Term-Document Matrix
convertReut21578XMLPlain

Transform a Reuters21578 XML Document to a Plain Text Document
XMLSource-class

Source for XML Files
readHTML

Read In a Simple HTML Document
removePunctuation-methods

Methods for Function removePunctuation in Package `tm'
weightTfIdf

Weight By Term Frequency Inverse Document Frequency
findFreqTerms

Find Frequent Terms
sFilter

Statement Filter
acq

50 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic acq
[-methods

Methods for Subset Functions in Package `tm'
tmTolower-methods

Methods for Function tmTolower in Package `tm'
tmReduce

Combine Transformations
removeCitation-methods

Methods for Function removeCitation in Package `tm'
removeNumbers-methods

Methods for Function removeNumbers in Package `tm'
Reuters21578Document-class

Reuters21578 Text Document
readNewsgroup

Read In a Newsgroup Document
summary-methods

Methods for Function summary in Package `tm'
weightBin

Weight Binary
termFreq

Term Frequency Vector
searchFullText-methods

Methods for Function searchFullText in Package `tm'
PlainTextDocument-class

Plain Text Document
VCorpus

Volatile Corpus Constructor
stripWhitespace-methods

Methods for Function stripWhitespace in Package `tm'
writeCorpus-methods

Methods for Function writeCorpus in Package `tm'
length-methods

Methods for Function length in Package `tm'
readReut21578XML

Read In a Reuters21578 XML Document
inspect

Inspect Objects