Learn R Programming

ds4psy (version 0.7.0)

text_to_words: Split string(s) of text x into words.

Description

text_to_words splits a string of text x (consisting of one or more character strings) into a vector of its constituting words.

Usage

text_to_words(x)

Arguments

x

A string of text (required), typically a character vector.

Value

A character vector (of words).

Details

text_to_words removes all (standard) punctuation marks and empty spaces in the resulting text parts, before returning a vector of the remaining character symbols (as its words).

Internally, text_to_words uses strsplit to split strings at punctuation marks (split = "[[:punct:]]") and blank spaces (split = "( ){1,}").

See Also

text_to_sentences for splitting text into a vector of sentences; text_to_chars for splitting text into a vector of characters; count_words for counting the frequency of words; strsplit for splitting strings.

Other text objects and functions: Umlaut, capitalize(), caseflip(), cclass, count_chars_words(), count_chars(), count_words(), l33t_rul35, map_text_chars(), map_text_coord(), map_text_regex(), metachar, read_ascii(), text_to_chars(), text_to_sentences(), transl33t()

Examples

Run this code
# NOT RUN {
# Default: 
x <- c("Hello!", "This is a 1st sentence.", "This is the 2nd sentence.", "The end.")
text_to_words(x)

# }

Run the code above in your browser using DataLab