Learn R Programming

tidytext (version 0.2.6)

parts_of_speech: Parts of speech for English words from the Moby Project

Description

Parts of speech for English words from the Moby Project by Grady Ward. Words with non-ASCII characters and items with a space have been removed.

Usage

parts_of_speech

Arguments

Format

A data frame with 205,985 rows and 2 variables:

word

An English word

pos

The part of speech of the word. One of 13 options, such as "Noun", "Adverb", "Adjective"

Details

Another dataset of English parts of speech, available only for non-commercial use, is available as part of SUBTLEXus at https://www.ugent.be/pp/experimentele-psychologie/en/research/documents/subtlexus/.

Examples

Run this code
# NOT RUN {
library(dplyr)

parts_of_speech

parts_of_speech %>%
  count(pos, sort = TRUE)

# }

Run the code above in your browser using DataLab