bundle-class: Bundle Class

Description

A bundle is used to combine several objects (partition, context, features, cooccurrences objects) into one S4 class object. Typically, a class inheriting from the bundle superclass will be used. When working with a context_bundle, a features_bundle, a cooccurrences_bundle, or a context_bundle, a similar set of standard methods is available to perform transformations.

Usage

# S4 method for bundle
name(x) <- value
# S4 method for bundle
length(x)
# S4 method for bundle
names(x)
# S4 method for bundle,vector
names(x) <- value
# S4 method for bundle
unique(x)
# S4 method for bundle,bundle
+(e1, e2)
# S4 method for bundle,textstat
+(e1, e2)
# S4 method for bundle
[[(x, i)
# S4 method for bundle,ANY,ANY,ANY
[(x, i)
# S4 method for bundle
[[(x, i) <- value
# S4 method for bundle
$(x, name)
# S4 method for bundle
$(x, name) <- value
# S4 method for bundle
sample(x, size)
# S4 method for list
as.bundle(object, ...)
# S4 method for textstat
as.bundle(object)
# S3 method for bundle
as.data.table(x, keep.rownames, col, ...)
# S4 method for bundle
as.matrix(x, col)
# S4 method for bundle
subset(x, ...)
# S4 method for bundle
as.list(x)
# S3 method for bundle
as.list(x, ...)
# S4 method for bundle
get_corpus(x)

Arguments

x: a bundle object
value: character string with a name to be assigned
e1: object 1
e2: object 2
i: integer or character values for indexing a bundle object.
name: The name of an object in the bundle object.
size: number of items to choose to generate a sample
object: A bundle object.
...: Further parameters
keep.rownames: Required argument to safeguard consistency with S3 method definition in the data.table package. Unused in this context.
col: columns of the data.table to use to generate an object.

Slots

corpus: The CWB corpus the xobjects in the bundle are based on, a length 1 character vector.

objects

An object of class list.

p_attribute

Object of class character.

encoding

The encoding of the corpus.

Author

Andreas Blaette

Examples

Run this code

use("RcppCWB", "REUTERS")

# generate bundle with articles in REUTERS corpus
b <- partition_bundle("REUTERS", s_attribute = "id")

# basic operations
length(b)
names(b)
get_corpus(b)
summary(b)

# enrich with count for p-attribute
b <- enrich(b, p_attribute = "word")

# Indexing and accessing bundle objects
reu <- corpus("REUTERS") %>% split(s_attribute = "id")
reu[1:3]
reu[-1]
reu[-(1:10)]
reu["127"]
reu$`127` # alternative access
reu[c("127", "273")]
reu[["127"]] <- NULL
pb <- partition_bundle("GERMAPARLMINI", s_attribute = "party")
pb$"NA" <- NULL # quotation needed if name is "NA"

# Turn bundle into data.table (not tested to save time)
# \donttest{
dt <- partition_bundle("REUTERS", s_attribute = "id") %>%
  cooccurrences(query = "oil", cqp = FALSE) %>%
  as.data.table(col = "ll")
# }

Run the code above in your browser using DataLab