Learn R Programming

⚠️There's a newer version (0.8.9) of this package.Take me there.

polmineR (version 0.8.3)

Verbs and Nouns for Corpus Analysis

Description

Package for corpus analysis using the Corpus Workbench ('CWB', ) as an efficient back end for indexing and querying large corpora. The package offers functionality to flexibly create subcorpora and to carry out basic statistical operations (count, co-occurrences etc.). The original full text of documents can be reconstructed and inspected at any time. Beyond that, the package is intended to serve as an interface to packages implementing advanced statistical procedures. Respective data structures (document-term matrices, term-co-occurrence matrices etc.) can be created based on the indexed corpora.

Copy Link

Version

Install

install.packages('polmineR')

Monthly Downloads

474

Version

0.8.3

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Andreas Blaette

Last Published

July 1st, 2020

Functions in polmineR (0.8.3)

annotations

Annotation functionality
as.TermDocumentMatrix

Generate TermDocumentMatrix / DocumentTermMatrix.
Cooccurrences-class

Cooccurrences class for corpus/partition.
bundle-class

Bundle Class
as.speeches

Split corpus or partition into speeches.
as.VCorpus

Get VCorpus.
blapply

apply a function over a list or bundle
Cooccurrences,corpus-method

Get all cooccurrences in corpus/partition.
as.markdown

Get markdown-formatted full text of a partition.
as.sparseMatrix

Type conversion - get sparseMatrix.
count

Get counts.
context_bundle-class

S4 context_bundle class
enrich

Enrich an object.
encodings

Conversion between corpus and native encoding.
context

Analyze context of a node word.
corpus-class

Corpus class initialization
decode

Decode corpus or subcorpus.
dispersion

Dispersion of a query or multiple queries.
corpus-methods

Corpus class methods
cooccurrences-class

Cooccurrences class.
cooccurrences

Get cooccurrence statistics.
features-class

Feature selection by comparison.
context-class

Context class.
dotplot

dotplot
chisquare

Perform chisquare-text.
encoding

Get and set encoding.
count_class

Count class.
cqp

Tools for CQP queries.
cpos

Get corpus positions for a query or queries.
highlight

Highlight tokens in text output.
hits

Get hits for query
ll

Compute Log-likelihood Statistics.
partition_bundle

Generate bundle of partitions.
kwic-class

S4 kwic class
kwic

Perform keyword-in-context (KWIC) analysis.
partition_bundle-class

Bundle of partitions (partition_bundle class).
partition_class

Partition class and methods.
partition_to_string

Decode as String.
registry_move

Get registry and data directories.
registry_get_name

Evaluate registry file.
p_attributes

Get p-attributes.
means

calculate means
partition

Initialize a partition.
ocpu_exec

Execute code on OpenCPU server
phrases

Manage and use phrases
noise

detect noise
pmi

Calculate Pointwise Mutual Information (PMI).
subset

Subsetting corpora and subcorpora
hits_class

Hits class.
html

Generate html from object.
features

Get features by comparison.
size

Get Number of Tokens.
reexports

Objects exported from other packages
polmineR-package

polmineR-package
subcorpus

The S4 subcorpus class.
read

Display full text.
trim

trim an object
regions

Regions of a CWB corpus.
slice

Virtual class slice.
use

Add corpora in R data packages to session registry.
subcorpus_bundle-class

Bundled subcorpora
renamed

Renamed Functions
terms

Get terms in partition or corpus.
registry_reset

Reset registry directory.
get_template

Get template for reconstructing full text.
polmineR-defunct

Defunct functionality
get_type

Get corpus/partition type.
ngrams_class

Ngrams class.
ngrams

Get N-Grams
get_token_stream

Get Token Stream.
polmineR-generics

Generic methods defined in the polmineR package
t_test

Perform t-test.
view

Inspect object using View().
restore

Restore S4 object with data.table slots
weigh

Apply Weight to Matrix
s_attributes

Get s-attributes.
tooltips

Add tooltips to text output.
textstat-class

S4 textstat superclass.