Learn R Programming

keras (version 2.0.9)

text_one_hot: One-hot encode a text into a list of word indexes in a vocabulary of size n.

Description

One-hot encode a text into a list of word indexes in a vocabulary of size n.

Usage

text_one_hot(text, n, filters = "!\"#$%&()*+,-./:;<=>?@[\\]^_`{|}~\t\n",
  lower = TRUE, split = " ")

Arguments

text

Input text (string).

n

Size of vocabulary (integer)

filters

Sequence of characters to filter out.

lower

Whether to convert the input to lowercase.

split

Sentence split marker (string).

Value

List of integers in [1, n]. Each integer encodes a word (unicity non-guaranteed).

See Also

Other text preprocessing: make_sampling_table, pad_sequences, skipgrams, text_hashing_trick, text_to_word_sequence