powered by
This tokenizer uses stringi::stri_split_boundaries() to tokenize a character vector. To be used with [explain.character()`.
stringi::stri_split_boundaries()
character
default_tokenize(text)
text to tokenize as a character vector
a character vector.
# NOT RUN { data('train_sentences') default_tokenize(train_sentences$text[1]) # }
Run the code above in your browser using DataLab