Learn R Programming

⚠️There's a newer version (0.7-17) of this package.Take me there.

tm (version 0.5-8.3)

Text Mining Package

Description

A framework for text mining applications within R.

Copy Link

Version

Install

install.packages('tm')

Monthly Downloads

40,028

Version

0.5-8.3

License

GPL (>= 2)

Maintainer

Ingo Feinerer

Last Published

January 28th, 2013

Functions in tm (0.5-8.3)

Allow `tm' to Use a Cluster

List Available Tokenizers

TermDocumentMatrix

Term-Document Matrix

List Available Sources

The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix

Uniform Resource Identifier Source

Reuters-21578 XML Source

Weight by Term Frequency

Read In a Text Document

20 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic crude

Read In a Reuters Corpus Volume 1 Document

Volatile Corpus

Full Text Search

SMART Weightings

getTransformations

List Available Transformations

FunctionGenerator

Function Generator

Intersection between Documents and Words

Read Document-Term Matrices

Remove Words from a Text Document

Read In an XML Document

Permanent Corpus Constructor

Inspect Objects

Visualize a Term-Document Matrix

DataframeSource

Data Frame Source

Combine Transformations

Weighting Function

Write a Corpus to Disk

Term Frequency Vector

Prescind Document Meta Data

Text Repository

Read In a MS Word Document

Compute a Tag Score

Explore Corpus Term Frequency Characteristics

Find Frequent Terms

50 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic acq

as.PlainTextDocument

Create Objects of Class PlainTextDocument

PlainTextDocument

Plain Text Document

Materialize Lazy Mappings

removePunctuation

Remove Punctuation Marks from a Text Document

Directory Source

Meta Data Management

Reuters21578Document

Reuters-21578 Text Document

Find Associations in a Term-Document Matrix

Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors

List Available Filters

Row, Column, Dim Names, Document IDs, and Terms

Access and Modify Text Documents

preprocessReut21578XML

Preprocess the Reuters-21578 XML archive.

removeSparseTerms

Remove Sparse Terms from a Term-Document Matrix

Remove Numbers from a Text Document

Read In a Text Document

Weight by Term Frequency - Inverse Document Frequency

Read In a Gmane RSS Feed

Filter and Index Functions on Corpora

Read In a PDF Document

Transformations on Corpora

stripWhitespace

Strip Whitespace from a Text Document

RCV1 Text Document

List Available Readers

readReut21578XML

Read In a Reuters-21578 XML Document

Split a Corpus into Chunks

Statement Filter