Learn R Programming

⚠️There's a newer version (0.7-14) of this package.Take me there.

tm (version 0.7-7)

Text Mining Package

Description

A framework for text mining applications within R.

Copy Link

Version

Install

install.packages('tm')

Monthly Downloads

53,395

Version

0.7-7

License

GPL-3

Maintainer

Last Published

December 12th, 2019

Functions in tm (0.7-7)

Source

Sources
PCorpus

Permanent Corpora
TextDocument

Text Documents
Zipf_n_Heaps

Explore Corpus Term Frequency Characteristics
VectorSource

Vector Source
readDOC

Read In a MS Word Document
WeightFunction

Weighting Function
inspect

Inspect Objects
plot

Visualize a Term-Document Matrix
hpc

Parallelized ‘lapply’
Reader

Readers
PlainTextDocument

Plain Text Documents
tm_filter

Filter and Index Functions on Corpora
SimpleCorpus

Simple Corpora
crude

20 Exemplary News Articles from the Reuters-21578 Data Set of Topic crude
Corpus

Corpora
weightSMART

SMART Weightings
Docs

Access Document IDs and Terms
ZipSource

ZIP File Source
DirSource

Directory Source
weightTfIdf

Weight by Term Frequency - Inverse Document Frequency
stripWhitespace

Strip Whitespace from a Text Document
VCorpus

Volatile Corpora
DataframeSource

Data Frame Source
findAssocs

Find Associations in a Term-Document Matrix
readDataframe

Read In a Text Document from a Data Frame
findFreqTerms

Find Frequent Terms
getTokenizers

Tokenizers
URISource

Uniform Resource Identifier Source
getTransformations

Transformations
termFreq

Term Frequency Vector
weightTf

Weight by Term Frequency
readXML

Read In an XML Document
tokenizer

Tokenizers
meta

Metadata Management
TermDocumentMatrix

Term-Document Matrix
stemDocument

Stem Words
tm_combine

Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors
acq

50 Exemplary News Articles from the Reuters-21578 Data Set of Topic acq
removePunctuation

Remove Punctuation Marks from a Text Document
content_transformer

Content Transformers
readPlain

Read In a Text Document
readPDF

Read In a PDF Document
weightBin

Weight Binary
readReut21578XML

Read In a Reuters-21578 XML Document
removeNumbers

Remove Numbers from a Text Document
stopwords

Stopwords
writeCorpus

Write a Corpus to Disk
findMostFreqTerms

Find Most Frequent Terms
tm_map

Transformations on Corpora
readTagged

Read In a POS-Tagged Word Text Document
removeSparseTerms

Remove Sparse Terms from a Term-Document Matrix
XMLSource

XML Source
removeWords

Remove Words from a Text Document
XMLTextDocument

XML Text Documents
tm_term_score

Compute Score for Matching Terms
foreign

Read Document-Term Matrices
readRCV1

Read In a Reuters Corpus Volume 1 Document
stemCompletion

Complete Stems
tm_reduce

Combine Transformations