stri_extract_all_boundaries

stri_extract_last_boundaries

stri_extract_first_boundaries

stri_extract_all_words

stri_extract_first_words

stri_extract_last_words

character vector or an object coercible to

single logical value;
if <code>TRUE</code> or <code>NA</code>, then a character matrix is returned;
otherwise (the default), a list of character vectors is given, see Value

simplify

single logical value; if <code>FALSE</code>,
then a missing value will indicate that there are no words

omit_no_match

additional settings for <code>opts_brkiter</code>

a named list with ICU BreakIterator's settings,
see <code><a rd-options="" href="/link/stri_opts_brkiter?package=stringi&version=1.3.1" data-mini-rdoc="stringi::stri_opts_brkiter">stri_opts_brkiter</a></code>;
<code>NULL</code> for the default break iterator, i.e., <code>line_break</code>

opts_brkiter

<code>NULL</code> or <code>""</code> for text boundary analysis following
the conventions of the default locale, or a single string with
locale identifier, see <a rd-options="" href="/link/stringi-locale?package=stringi&version=1.3.1" data-mini-rdoc="stringi::stringi-locale">stringi-locale</a>

locale

These functions extract data between text boundaries.

Fast, correct, consistent, portable,
as well as convenient character string/text processing in every locale
and any native encoding. Owing to the use of the 'ICU'
(International Components for Unicode) library,
the package provides 'R' users with platform-independent functions
known to 'Java', 'Perl', 'Python', 'PHP', and 'Ruby' programmers. Available
features include: pattern searching (e.g., with 'Java'-like regular
expressions or the 'Unicode' collation algorithm), random string generation,
case mapping, string transliteration, concatenation,
Unicode normalization, date-time formatting and parsing, and many more.

Marek Gagolewski

stringi

Character String Processing Facilities

stri_extract_all_boundaries function

a named list with ICU BreakIterator's settings,
see <code><a rd-options='' href='stri_opts_brkiter'>stri_opts_brkiter</a></code>;
<code>NULL</code> for the default break iterator, i.e., <code>line_break</code>

<code>NULL</code> or <code>""</code> for text boundary analysis following
the conventions of the default locale, or a single string with
locale identifier, see <a rd-options='' href='stringi-locale'>stringi-locale</a>

stri_extract_all_boundaries: Extract Data Between Text Boundaries

Description

Usage

Arguments

Value

Details

See Also

Examples