Learn R Programming

⚠️There's a newer version (0.7-14) of this package.Take me there.

tm (version 0.3-1)

Text Mining Package

Description

A framework for text mining applications within R.

Copy Link

Version

Install

install.packages('tm')

Monthly Downloads

53,395

Version

0.3-1

License

GPL-2

Maintainer

Last Published

August 16th, 2024

Functions in tm (0.3-1)

Corpus-class

Corpus
dissimilarity-methods

Methods for Function dissimilarity in Package `tm'
appendMeta-methods

Methods for Function appendMeta in Package `tm'
WeightFunction

Weighting Function Constructor
TextRepository-class

Text Repository
TextRepository

Text Repository
Dictionary-class

Dictionary
convertReut21578XMLPlain

Transform a Reuters21578 XML Document to a Plain Text Document
readDOC

Read In a MS Word Document
TermDocMatrix

Term-Document Matrix
acq

50 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic acq
colnames-methods

Methods for Function colnames in Package `tm'
inspect-methods

Methods for Function inspect in Package `tm'
sFilter

Statement Filter
prescindMeta-methods

Methods for Function prescindMetadata in Package `tm'
tmFilter-methods

Methods for Function tmFilter in Package `tm'
getSources

Get Available Sources
Corpus

Corpus
findAssocs-methods

Methods for Function findAssocs in Package `tm'
getReaders

Get Available Readers
materialize

Materialize Lazy Mappings
VectorSource

Gmane Source
GmaneSource

Gmane Source
removeMultipart-methods

Methods for Function removeMultipart in Package `tm'
termFreq

Term Frequency Vector
RCV1Document-class

RCV1 Text Document
getTransformations

Get Available Transformations
plot

Visualize a Term-Document Matrix
readRCV1

Read In a Reuters Corpus Volume 1 Document
convertMboxEml

Convert E-Mails From mbox Format To eml Format
readGmane

Read In A Newsgroup Document
appendElem-methods

Methods for Function appendElem in Package `tm'
XMLTextDocument-class

Text document
FunctionGenerator

Function Generator Constructor
MetaDataNode-class

Metadata Node
createDictionary-methods

Methods for Function createDictionary in Package `tm'
readPDF

Read In a PDF Document
removeMeta-methods

Methods for Function removeMeta in Package `tm'
removePunctuation-methods

Methods for Function removePunctuation in Package `tm'
ncol-methods

Methods for Function ncol in Package `tm'
tm-internal

Internal tm Functions
crude

20 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic crude
weightTfIdf

Weight By Term Frequency Inverse Document Frequency
DublinCore-methods

Methods for Function DublinCore in Package `tm'
tmUpdate-methods

Methods for Function tmUpdate in Package `tm'
DirSource-class

Source for Directories
asPlain-methods

Methods for Function asPlain in Package `tm'
CSVSource

Comma Seperated Value Source
tmIntersect-methods

Methods for Function tmIntersect in Package `tm'
PlainTextDocument-class

Plain Text Document
weightBin

Weight Binary
CSVSource-class

Source for Comma Separated Files
NewsgroupDocument-class

Newsgroup Text Document
length-methods

Methods for Function length in Package `tm'
GmaneSource-class

Source for Gmane Feeds
dim-methods

Methods for Function dim in Package `tm'
readNewsgroup

Read In a Newsgroup Document
Source-class

Source Manager
removeCitation-methods

Methods for Function removeCitation in Package `tm'
weightTf

Weight By Term Frequency
WeightFunction-class

Weighting Function
stemCompletion

Complete Stems
eoi-methods

Methods for Function eoi in Package `tm'
loadDoc-methods

Methods for Function loadDoc in Package `tm'
meta-methods

Methods for Function meta in Package `tm'
readPlain

Read In a Text Document
preprocessReut21578XML

Preprocess the Reuters21578 XML archive.
readHTML

Read In a Simple HTML Document
removeSignature-methods

Methods for Function removeSignature in Package `tm'
tmTolower-methods

Methods for Function tmTolower in Package `tm'
getElem-methods

Methods for Function getElem in Package `tm'
show-methods

Methods for Function show in Package `tm'
stopwords

Multilingual Stopwords
DirSource

Directory Source
removeNumbers-methods

Methods for Function removeNumbers in Package `tm'
Dictionary

Dictionary
ReutersSource-class

Source for Reuters Files
replacePatterns-methods

Methods for Function replacePatterns in Package `tm'
tmIndex-methods

Methods for Function tmIndex in Package `tm'
stemDoc-methods

Methods for Function stemDoc in Package `tm'
weightLogical

Weight Logical
ReutersSource

Reuters Source
stepNext-methods

Methods for Function stepNext in Package `tm'
getFilters

Get Available Filters
makeChunks

Split a Corpus into Chunks
stripWhitespace-methods

Methods for Function stripWhitespace in Package `tm'
findFreqTerms-methods

Methods for Function findFreqTerms in Package `tm'
[-methods

Methods for Subset Functions in Package `tm'
convertRCV1Plain

Transform a RCV1 Document to a Plain Text Document
VectorSource-class

Source for Vectors
readReut21578XML

Read In a Reuters21578 XML Document
rownames-methods

Methods for Function rownames in Package `tm'
removeWords-methods

Methods for Function removeWords in Package `tm'
dimnames-methods

Methods for Function dimnames in Package `tm'
searchFullText-methods

Methods for Function searchFullText in Package `tm'
TermDocMatrix-class

Term-Document Matrix
Reuters21578Document-class

Reuters21578 Text Document
StructuredTextDocument-class

Structured Text Document
summary-methods

Methods for Function summary in Package `tm'
tmMap-methods

Methods for Function tmMap in Package `tm'
%IN%-methods

Methods for Function %IN% in Package `tm'
TextDocument-class

Text Document
nrow-methods

Methods for Function nrow in Package `tm'
FunctionGenerator-class

Function Generator
c-methods

Methods for Function c in Package `tm'
writeCorpus-methods

Methods for Function writeCorpus in Package `tm'
removeSparseTerms-methods

Methods for Function removeSparseTerms in Package `tm'
DataframeSource

Data Frame Source
deactivateCluster

Disallow `tm' to Use a Cluster
activateCluster

Allow `tm' to Use a Cluster If Available
inspect

Inspect Objects
dissimilarity

Dissimilarity
URISource-class

Source for Directories
XMLSource-class

Source for XML Files
DataframeSource-class

Source for Data Frames
readTabular

Read In a Text Document
findFreqTerms

Find Frequent Terms
replaceWords-methods

Methods for Function replaceWords in Package `tm'
XMLSource

XML Source
findAssocs

Find Associations in a Term-Document Matrix
readXML

Read In an XML Document
number

The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix
URISource

Uniform Resource Identifier Source
names

Row, Column, Dim Names, Document IDs, and Terms
pGetElem-methods

Methods for Function pGetElem in Package `tm'
tmReduce

Combine Transformations
TermDocumentMatrix

Term-Document Matrix
removeSparseTerms

Remove Sparse Terms from a Term-Document Matrix