Learn R Programming

corpora (version 0.6)

PassiveBrownFam: By-text frequencies of passive verb phrases in the Brown Family corpora.

Description

This data set specifies the number of passive and active verb phrases for each text in the extended Brown Family of corpora (Brown, LOB, Frown, FLOB, BLOB), covering edited written American and British English from 1930s, 1960s and 1990s (see Xiao 2008, 395--397).

Verb phrase and passive/active aspect counts are based on a fully automatic analysis of the texts, using the Pro3Gres parser (Schneider et al. 2004).

Usage

PassiveBrownFam

Arguments

Format

A data frame with 2499 rows and the following 11 columns:

id:

A unique ID for each text (also used as row name)

corpus:

Corpus, a factor with five levels BLOB, Brown, LOB, Frown, FLOB

section:

Genre, a factor with fifteen levels A, ..., R (Brown section codes)

genre:

Genre labels, a factor with fifteen levels (e.g. press reportage)

period:

Date of publication, a factor with three levels (1930, 1960, 1990)

lang:

Language variety / region, a factor with levels AmE (U.S.) and BrE (UK)

n.words:

Number of word tokens, an integer vector

act:

Number of active verb phrases, an integer vector

pass:

Number of passive verb phrases, an integer vector

verbs:

Total number of verb phrases, an integer vector

p.pass:

Percentage of passive verb phrases in the text, a numeric vector

Acknowledgements

Frequency information for this data set was kindly provided by Gerold Schneider, University of Zurich (http://www.cl.uzh.ch/de/people/team/compling/gschneid.html).

Author

Stephanie Evert (https://purl.org/stephanie.evert)

Details

No frequency data could be obtained for text N02 in the Frown corpus. This entry has been omitted from the table.

References

Schneider, Gerold; Rinaldi, Fabio; Dowdall, James (2004). Fast, deep-linguistic statistical dependency parsing. In G.-J. M. Kruijff and D. Duchier (eds.), Proceedings of the COLING 2004 Workshop on Recent Advances in Dependency Grammar, pages 33-40, Geneva, Switzerland. https://files.ifi.uzh.ch/cl/gschneid/parser/

Xiao, Richard (2008). Well-known and influential corpora. In A. Lüdeling and M. Kytö (eds.), Corpus Linguistics. An International Handbook, chapter 20, pages 383--457. Mouton de Gruyter, Berlin.