Learn R Programming

polmineR (version 0.8.8)

p_attributes: Get p-attributes.

Description

In a CWB corpus, every token has positional attributes. While s-attributes cover a range of tokens, every single token in the token stream of a corpus will have a set of positional attributes (such as part-of-speech, or lemma). The available p-attributes are returned by the p_attributes-method.

Usage

p_attributes(.Object, ...)

# S4 method for character p_attributes(.Object, p_attribute = NULL)

# S4 method for corpus p_attributes(.Object, p_attribute = NULL)

# S4 method for slice p_attributes(.Object, p_attribute = NULL, decode = TRUE)

# S4 method for partition_bundle p_attributes(.Object, p_attribute = NULL, decode = TRUE)

# S4 method for remote_corpus p_attributes(.Object, ...)

# S4 method for remote_partition p_attributes(.Object, ...)

Arguments

.Object

A length-one character vector, or a partition object.

...

Arguments passed to get_token_stream.

p_attribute

A p-attribute to decode, provided by a length-one character vector.

decode

A length-one logical value. Whether to return decoded p-attributes or unique token ids.

Details

The p_attributes-method returns the p-attributes defined for the corpus the partition is derived from, if argument p_attribute is NULL (the default). If p_attribute is defined, the unique values for the p-attribute are returned.

References

Stefan Evert & The OCWB Development Team, CQP Query Language Tutorial, https://cwb.sourceforge.io/files/CQP_Tutorial.pdf.

Examples

Run this code
use(pkg = "RcppCWB", corpus = "REUTERS")

p_attributes("REUTERS")
p_attributes("REUTERS", p_attribute = "word")
merkel <- partition("GERMAPARLMINI", speaker = "Merkel", regex = TRUE)
merkel_words <- p_attributes(merkel, "word")

Run the code above in your browser using DataLab