Learn R Programming

polmineR (version 0.8.9)

dispersion: Dispersion of a query or multiple queries.

Description

The method returns a data.table with the number of matches of a query or multiple queries (optionally frequencies) in a corpus or subcorpus as partitioned by one or two s-attributes.

Usage

dispersion(.Object, ...)

# S4 method for slice dispersion( .Object, query, s_attribute, cqp = FALSE, p_attribute = getOption("polmineR.p_attribute"), freq = FALSE, fill = TRUE, mc = FALSE, progress = FALSE, verbose = FALSE, ... )

# S4 method for partition dispersion( .Object, query, s_attribute, cqp = FALSE, p_attribute = getOption("polmineR.p_attribute"), freq = FALSE, fill = TRUE, mc = FALSE, progress = TRUE, verbose = FALSE, ... )

# S4 method for subcorpus dispersion( .Object, query, s_attribute, cqp = FALSE, p_attribute = getOption("polmineR.p_attribute"), freq = FALSE, fill = FALSE, mc = FALSE, progress = TRUE, verbose = FALSE, ... )

# S4 method for corpus dispersion( .Object, query, s_attribute, cqp = is.cqp, p_attribute = getOption("polmineR.p_attribute"), freq = FALSE, fill = TRUE, mc = FALSE, progress = FALSE, verbose = FALSE, ... )

# S4 method for character dispersion( .Object, query, s_attribute, cqp = is.cqp, p_attribute = getOption("polmineR.p_attribute"), freq = FALSE, fill = TRUE, mc = FALSE, progress = TRUE, verbose = TRUE, ... )

# S4 method for hits dispersion( .Object, source, s_attribute, freq = FALSE, fill = TRUE, verbose = TRUE, ... )

# S4 method for remote_corpus dispersion(.Object, ...)

# S4 method for remote_subcorpus dispersion(.Object, ...)

Value

A data.table.

A data.table.

Arguments

.Object

A corpus, subcorpus or partition object or a corpus provided by a character string.

...

Further parameters.

query

A character vector stating one or multiple queries.

s_attribute

A character vector (length 1 or 2) providing s-attributes.

cqp

If logical, whether the query is a CQP query, if it is a function that is passed in, the function will be applied to the query to guess whether query is a CQP query

p_attribute

Length one character vector, the p-attribute that will be looked up (typically 'word' or 'lemma').

freq

A logical value, whether to calculate normalized frequencies.

fill

A logical value, whether to report zero matches. Defaults to TRUE. But note that if there are few matches and many values of the s-attribute(s), the resulting data structure is sparse and potentially bloated.

mc

A logical value, whether to use multicore.

progress

A logical value, whether to show progress.

verbose

A logical value, whether to be verbose.

source

The source of the evaluation the hits reported in .Object are based on, a corpus, subcorpus or partition object.

Author

Andreas Blaette

Details

Augmenting the data.table with zeros for subcorpora that do not yield query matches (argument fill = TRUE) may require adding many new columns. A respective warning issued by the data.table package is supplemented an additional explanatory note of the polmineR package.

See Also

The worker behind the dispersion-method is the hits-method.

count

Examples

Run this code
use("polmineR")
dispersion("GERMAPARLMINI", query = "Integration", s_attribute = "date")

test <- partition("GERMAPARLMINI", date = ".*", p_attribute = NULL, regex = TRUE)
integration <- dispersion(
  test, query = "Integration",
  p_attribute = "word", s_attribute = "date"
)
integration <- dispersion(test, "Integration", s_attribute = c("date", "party"))
integration <- dispersion(test, '"Integration.*"', s_attribute = "date", cqp = TRUE)

Run the code above in your browser using DataLab