Learn R Programming

tm.plugin.europresse (version 1.4)

readEuropresseHTML: Read in a Europresse article in the HTML format

Description

Read in an article exported from Europresse in the HTML format.

Usage

readEuropresseHTML1(elem, language, id) readEuropresseHTML2(elem, language, id)

Arguments

elem
A list with the named element content which must hold the document to be read in.
language
A character vector giving the text's language. If set to NA, the language will automatically be set to the value reported in the document (which is usually correct).
id
A character vector representing a unique identification string for the returned text document.

Value

A PlainTextDocument with the contents of the article and the available meta-data set.

Details

readEuropresseHTML1 reads documents in the old format, while readEuropresseHTML2 reads documents in the new one. EuropresseSource automatically chooses the correct reader based on the structure of the file.

See Also

getReaders to list available reader functions.