Read in a Reuters-21578 XML document.
readReut21578XML(elem, language, id)
readReut21578XMLasPlain(elem, language, id)
An XMLTextDocument
for readReut21578XML
, or a
PlainTextDocument
for readReut21578XMLasPlain
,
representing the text and metadata extracted from elem$content
.
a named list with the component content
which must hold
the document to be read in.
a string giving the language.
Not used.
Lewis, David (1997) Reuters-21578 Text Categorization Collection Distribution 1.0. https://archive.ics.uci.edu/ml/datasets/reuters-21578+text+categorization+collection
Reader
for basic information on the reader infrastructure
employed by package tm.