Learn R Programming

⚠️There's a newer version (0.10.2) of this package.Take me there.

corpus (version 0.8.0)

Text Corpus Analysis

Description

Text corpus data analysis, with full support for Unicode. Functions for reading data from newline-delimited JSON files, for normalizing and tokenizing text, for searching for term occurrences, and for computing term occurrence frequencies (including n-grams).

Copy Link

Version

Install

install.packages('corpus')

Monthly Downloads

129

Version

0.8.0

License

Apache License (== 2.0) | file LICENSE

Issues

Pull Requests

Stars

Forks

Repository

https://github.com/patperry/r-corpus

Maintainer

Patrick Perry

Last Published

July 19th, 2017

Functions in corpus (0.8.0)

Term Frequency Tabulation

UTF-8 Text Handling

The Federalist Papers

JSON Data Input

Text Tokenization

Text Type Sets.

Term Frequencies

corpus-deprecated

Deprecated Functions in Package corpus

The Corpus Package

Searching for terms in text.

Segmenting Text