Learn R Programming

utf8

utf8 is an R package for manipulating and printing UTF-8 text that fixes multiple bugs in R’s UTF-8 handling.

Installation

Stable version

utf8 is available on CRAN. To install the latest released version, run the following command in R:

Development version

To install the latest development version, run the following:

Usage

Validate character data and convert to UTF-8

Use as_utf8() to validate input text and convert to UTF-8 encoding. The function alerts you if the input text has the wrong declared encoding:

Normalize data

Use utf8_normalize() to convert to Unicode composed normal form (NFC). Optionally apply compatibility maps for NFKC normal form or case-fold.

Copy Link

Version

Install

install.packages('utf8')

Monthly Downloads

1,383,737

Version

1.2.3

License

Apache License (== 2.0) | file LICENSE

Issues

Pull Requests

Stars

Forks

Last Published

October 22nd, 2023

Functions in utf8 (1.2.3)

utf8-package

The utf8 Package
utf8_width

Measure the Character String Width
as_utf8

UTF-8 Character Encoding
utf8_encode

Encode Character Object as for UTF-8 Printing
output_utf8

Output Capabilities
utf8_normalize

Text Normalization
utf8_format

UTF-8 Text Formatting
utf8_print

Print UTF-8 Text