This function converts integer vectors, representing sequences of UTF-32 code points, to UTF-8 strings.
stri_enc_fromutf32(vec)
Returns a character vector (in UTF-8).
NULL
s in the input list are converted to NA_character_
.
a list of integer vectors (or objects coercible to such vectors)
or NULL
s. For convenience, a single integer vector can also
be given.
Marek Gagolewski and other contributors
UTF-32 is a 32-bit encoding where each Unicode code point corresponds to exactly one integer value.
This function is a vectorized version of
intToUtf8
. As usual in stringi,
it returns character strings in UTF-8.
See stri_enc_toutf32
for a dual operation.
If an ill-defined code point is given, a warning is generated
and the corresponding string is set to NA
.
Note that 0
s are not allowed in vec
, as they are used
internally to mark the end of a string (in the C API).
See also stri_encode
for decoding arbitrary byte sequences
from any given encoding.
The official online manual of stringi at https://stringi.gagolewski.com/
Gagolewski M., stringi: Fast and portable character string processing in R, Journal of Statistical Software 103(2), 2022, 1-59, tools:::Rd_expr_doi("10.18637/jss.v103.i02")
Other encoding_conversion:
about_encoding
,
stri_enc_toascii()
,
stri_enc_tonative()
,
stri_enc_toutf32()
,
stri_enc_toutf8()
,
stri_encode()