Learn R Programming

tau (version 0.0-26)

ligatures: Translate Unicode Latin Ligatures

Description

Translate Unicode “Latin ligature” characters to their respective constituents.

Usage

translate_Unicode_latin_ligatures(x)

Arguments

x

a character vector in UTF-8 encoding.

Details

In typography, a ligature occurs where two or more graphemes are joined as a single glyph. (See https://en.wikipedia.org/wiki/Typographic_ligature for more information.)

Unicode (http://www.unicode.org/) lists the following “Latin” ligatures:

CodeName
0132LATIN CAPITAL LIGATURE IJ
0133LATIN SMALL LIGATURE IJ
0152LATIN CAPITAL LIGATURE OE
0153LATIN SMALL LIGATURE OE
FB00LATIN SMALL LIGATURE FF
FB01LATIN SMALL LIGATURE FI
FB02LATIN SMALL LIGATURE FL
FB03LATIN SMALL LIGATURE FFI
FB04LATIN SMALL LIGATURE FFL
FB05LATIN SMALL LIGATURE LONG S T
FB06LATIN SMALL LIGATURE ST

translate_Unicode_latin_ligatures translates these to their respective constituent characters.