Reads a text file in ins entirety, re-encodes it, and splits it into text lines.
stri_read_lines(con, encoding = NULL, fname = con, fallback_encoding = NULL)
name of the output file or a connection object (opened in the binary mode)
single string; input encoding;
NULL
or ''
for the current default encoding.
deprecated alias of con
deprecated argument, no longer used
Returns a character vector, each text line is a separate string. The output is always marked as UTF-8.
This aims to be a substitute for the readLines
function,
with the ability to re-encode the input file in a much more robust way,
and split the text into lines with stri_split_lines1
(which conforms with the Unicode guidelines for newline markers).
The function calls stri_read_raw
,
stri_encode
, and stri_split_lines1
,
in this order.
Because of the way this function is currently implemented, maximal file size cannot exceed ~0.67 GB.
Other files:
stri_read_raw()
,
stri_write_lines()