Learn R Programming

textTools (version 0.1.0)

ngrams: Create n-grams

Description

Create n-grams

Usage

ngrams(
  x,
  text,
  group_by = c(),
  count_col_name = "count",
  n,
  ngram_prefix = NULL
)

Arguments

x

A text.table created by as.text.table().

text

A string, the name of the column in x to build n-grams with.

group_by

A vector of column names to group by. Doesn't work if the group by column is a list column.

count_col_name

A string, the name of the output column containing the number of times each base record appears in the group.

n

A integer, the number of grams to make.

ngram_prefix

A string, a prefix to add to the output n-gram columns.

Value

A text.table, with columns added for n-grams (the word, the count, and percent of the time the gram follows the word).

Examples

Run this code
# NOT RUN {
ngrams(
as.text.table(
  x = as.data.table(
    list(
      col1 = c(
        "a",
        "b"
      ),
      col2 = c(
        tolower("The dog is nice because it picked up the newspaper."),
        tolower("The dog is extremely nice because it does the dishes.")
      )
    )
  ),
  text = "col2",
  split = " "
),
text = "col2",
group_by = "col1",
n = 2
)
# }

Run the code above in your browser using DataLab