chr_unserialise_unicode

<a href="https://lifecycle.r-lib.org/articles/stages.html#experimental"><img src="figures/lifecycle-experimental.svg?package=rlang&version=1.1.0" alt="[Experimental]"></a>
For historical reasons, R translates strings to the native encoding
when they are converted to symbols. This string-to-symbol
conversion is not a rare occurrence and happens for instance to the
names of a list of arguments converted to a call by <code><a href="/link/do.call()?package=rlang&version=1.1.0" data-mini-rdoc="rlang::do.call()">do.call()</a></code>.
If the string contains unicode characters that cannot be
represented in the native encoding, R serialises those as an ASCII
sequence representing the unicode point. This is why Windows users
with western locales often see strings looking like <code>&lt;U+xxxx&gt;</code>. To
alleviate some of the pain, rlang parses strings and looks for
serialised unicode points to translate them back to the proper
UTF-8 representation. This transformation occurs automatically in
functions like <code>env_names()</code> and can be manually triggered with
<code>as_utf8_character()</code> and <code>chr_unserialise_unicode()</code>.

internal

A toolbox for working with base types, core R features
like the condition system, and core 'Tidyverse' features like tidy
evaluation.

Lionel Henry

rlang

Functions for Base Types and Core R and 'Tidyverse' Features

Hadley Wickham

mikefc 

Yann Collet

Posit, PBC 

chr_unserialise_unicode function

<dl><dt>chr</dt>
<dd>A character vector.</dd></dl>

Arguments

This function is experimental.

Life cycle

<a href='https://lifecycle.r-lib.org/articles/stages.html#experimental'><img src='figures/lifecycle-experimental.svg' alt='[Experimental]' /></a>
For historical reasons, R translates strings to the native encoding
when they are converted to symbols. This string-to-symbol
conversion is not a rare occurrence and happens for instance to the
names of a list of arguments converted to a call by <code><a href='https://rdrr.io/r/base/do.call.html'>do.call()</a></code>.
If the string contains unicode characters that cannot be
represented in the native encoding, R serialises those as an ASCII
sequence representing the unicode point. This is why Windows users
with western locales often see strings looking like <code>&lt;U+xxxx&gt;</code>. To
alleviate some of the pain, rlang parses strings and looks for
serialised unicode points to translate them back to the proper
UTF-8 representation. This transformation occurs automatically in
functions like <code>env_names()</code> and can be manually triggered with
<code>as_utf8_character()</code> and <code>chr_unserialise_unicode()</code>.

Translate unicode points to UTF-8 — chr_unserialise_unicode

<dl>

<dt>chr</dt>
<dd>A character vector.</dd>

</dl>

chr_unserialise_unicode: Translate unicode points to UTF-8

Description

Usage

Arguments

Life cycle

Examples