stri_extract_all_boundaries

stri_extract_last_boundaries

stri_extract_first_boundaries

stri_extract_all_words

stri_extract_first_words

stri_extract_last_words

character vector or an object coercible to

single logical value;
if <code>TRUE</code> or <code>NA</code>, then a character matrix is returned;
otherwise (the default), a list of character vectors is given, see Value

simplify

single logical value; if <code>FALSE</code>,
then a missing value will indicate that there are no words

omit_no_match

additional settings for <code>opts_brkiter</code>

a named list with ICU BreakIterator's settings,
see <code><a rd-options="" href="/link/stri_opts_brkiter?package=stringi&version=1.5.3" data-mini-rdoc="stringi::stri_opts_brkiter">stri_opts_brkiter</a></code>;
<code>NULL</code> for the default break iterator, i.e., <code>line_break</code>

opts_brkiter

<code>NULL</code> or <code>''</code> for text boundary analysis following
the conventions of the default locale, or a single string with
locale identifier, see <a rd-options="" href="/link/stringi-locale?package=stringi&version=1.5.3" data-mini-rdoc="stringi::stringi-locale">stringi-locale</a>

locale

These functions extract data between text boundaries.

A multitude of character string/text/natural language
processing tools: pattern searching (e.g., with 'Java'-like regular
expressions or the 'Unicode' collation algorithm), random string generation,
case mapping, string transliteration, concatenation, sorting, padding,
wrapping, Unicode normalisation, date-time formatting and parsing,
and many more. They are fast, consistent, convenient, and -
owing to the use of the 'ICU' (International Components for Unicode)
library - portable across all locales and platforms.

Marek Gagolewski

stringi

Character String Processing Facilities

stri_extract_all_boundaries function

a named list with ICU BreakIterator's settings,
see <code><a rd-options='' href='stri_opts_brkiter'>stri_opts_brkiter</a></code>;
<code>NULL</code> for the default break iterator, i.e., <code>line_break</code>

<code>NULL</code> or <code>''</code> for text boundary analysis following
the conventions of the default locale, or a single string with
locale identifier, see <a rd-options='' href='stringi-locale'>stringi-locale</a>

stri_extract_all_boundaries: Extract Data Between Text Boundaries

Description

Usage

Arguments

Value

Details

See Also

Examples