Convert text to a sequence of words (or tokens).
text_to_word_sequence(
text,
filters = "!\"#$%&()*+,-./:;<=>?@[\\]^_`{|}~\t\n",
lower = TRUE,
split = " "
)=>
Words (or tokens)
Input text (string).
Sequence of characters to filter out such as punctuation. Default includes basic punctuation, tabs, and newlines.
Whether to convert the input to lowercase.
Sentence split marker (string).
Other text preprocessing:
make_sampling_table()
,
pad_sequences()
,
skipgrams()
,
text_hashing_trick()
,
text_one_hot()