Learn R Programming

⚠️There's a newer version (0.8.9) of this package.Take me there.

polmineR (version 0.8.8)

Verbs and Nouns for Corpus Analysis

Description

Package for corpus analysis using the Corpus Workbench ('CWB', ) as an efficient back end for indexing and querying large corpora. The package offers functionality to flexibly create subcorpora and to carry out basic statistical operations (count, co-occurrences etc.). The original full text of documents can be reconstructed and inspected at any time. Beyond that, the package is intended to serve as an interface to packages implementing advanced statistical procedures. Respective data structures (document-term matrices, term-co-occurrence matrices etc.) can be created based on the indexed corpora.

Copy Link

Version

Install

install.packages('polmineR')

Monthly Downloads

474

Version

0.8.8

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Andreas Blaette

Last Published

March 29th, 2023

Functions in polmineR (0.8.8)

as.markdown

Get markdown-formatted full text of a partition.
as.VCorpus

Get VCorpus.
blapply

apply a function over a list or bundle
as.speeches

Split corpus or partition into speeches.
as.sparseMatrix

Type conversion - get sparseMatrix.
as.TermDocumentMatrix

Generate TermDocumentMatrix / DocumentTermMatrix.
annotations

Annotation functionality
bundle-class

Bundle Class
Cooccurrences-class

Cooccurrences class for corpus/partition.
Cooccurrences,corpus-method

Get all cooccurrences in corpus/partition.
context-class

Context class.
cooccurrences

Get cooccurrence statistics.
context_bundle-class

S4 context_bundle class
capitalize

Capitalize character vector.
corpus-methods

Corpus class methods
context

Analyze context of a node word.
chisquare

Perform chisquare-text.
corpus-class

Corpus class initialization
count

Get counts.
cooccurrences-class

Cooccurrences class.
count_class

Count class.
decode

Decode corpus or subcorpus.
encodings

Conversion between corpus and native encoding.
dispersion

Dispersion of a query or multiple queries.
enrich

Enrich an object.
features-class

Feature selection by comparison.
dotplot

dotplot
encoding

Get and set encoding.
cpos

Get corpus positions for a query or queries.
cqp

Tools for CQP queries.
features

Get features by comparison.
get_token_stream

Get Token Stream.
get_type

Get corpus/partition type.
kwic

Perform keyword-in-context (KWIC) analysis.
kwic-class

S4 kwic class
hits

Get hits for query
hits_class

S4 class to represent hits for queries.
highlight

Highlight tokens in text output.
ngrams

Get N-Grams
ngrams_class

Ngrams class.
html

Generate html from object.
partition_bundle-class

Bundle of partitions (partition_bundle class).
p_attributes

Get p-attributes.
is_nested

Check whether s-attributes of corpus are nested
partition_bundle

Generate bundle of partitions.
ocpu_exec

Execute code on OpenCPU server
noise

detect noise
partition

Initialize a partition.
pmi

Calculate Pointwise Mutual Information (PMI).
phrases

Manage and use phrases
means

calculate means
partition_class

Partition class and methods.
polmineR-defunct

Defunct functionality
ll

Compute Log-likelihood Statistics.
partition_to_string

Decode as String.
read

Display full text.
reexports

Objects exported from other packages
polmineR-generics

Generic methods defined in the polmineR package
regions

Regions of a CWB corpus.
ranges-class

Ranges of query matches.
renamed

Renamed Functions
subcorpus

The S4 subcorpus class.
registry_reset

Reset registry directory.
slice

Virtual class slice.
registry_get_name

Evaluate registry file.
registry_move

Get registry and data directories.
textstat-class

S4 textstat superclass.
tooltips

Add tooltips to text output.
terms

Get terms in partition or corpus.
trim

Trim an object.
view

Inspect object using View().
use

Add corpora in R data packages to session registry.
ranges

Get ranges for query.
polmineR-package

polmineR-package
subcorpus_bundle-class

Bundled subcorpora
subset

Subsetting corpora and subcorpora
size

Get Number of Tokens.
s_attributes

Get s-attributes.
weigh

Apply Weight to Matrix
t_test

Perform t-test.
get_template

Get template for reconstructing full text.