Learn R Programming

qdap (version 0.2.5)

stemmer: Stem Text

Description

stemmer - Stems a vector of text strings. stem.words - Wrapper for stemmer that stems a vector of words. stem2df - Wrapper for stemmer that stems a vector of text strings and returns a dataframe with the vector added..

Usage

stemmer(text.var, rm.bracket = TRUE, capitalize = TRUE,
    warn = TRUE, char.keep = "~~", ...)

  stem.words(...)

  stem2df(dataframe, text.var, stem.name = NULL, ...)

Arguments

text.var
The text variable. In stemmer this is a vector text string. For stem2df this is a character vector of length one naming the text column.
rm.bracket
logical. If TRUE brackets are removed from the text.
capitalize
logical. If TRUE selected terms are capitalized.
warn
logical. If TRUE warns about rows not ending with standard qdap punctuation endmarks.
char.keep
A character vector of symbols that should be kept within sentences.
...
Various: stemmer - Other arguments passed to capitalizer stem.words - Words or terms. stem2df - Other arguments passed to
dataframe
A dataframe object.
stem.name
A character vector of length one for the stemmed column. If NULL defaults to "stem.text".

Value

  • stemmer - returns a character vector with stemmed text. stem.words - returns a vector of individually stemmed words. stem2df - returns a dataframe with a character vector with stemmed text.

See Also

capitalizer

Examples

Run this code
#stemmer EXAMPLE:
stemmer(DATA$state)
out1 <- stemmer(raj$dialogue)
htruncdf(out1, 20, 60)

#stem.words EXAMPLE:
stem.words(doggies, jumping, swims)

#stem2df EXAMPLE:
out2 <- stem2df(DATA, "state", "new")
truncdf(out2, 30)

Run the code above in your browser using DataLab