Learn R Programming

tm (version 0.3-3)

readDOC: Read In a MS Word Document

Description

Returns a function which reads in a Microsoft Word document extracting its text.

Usage

readDOC(...)

Arguments

...
Arguments for the generator function.

Value

  • A function with the signature elem, language, load, id:
  • elemA list with the two named elements content and uri. The first element must hold the document to be read in, the second element must hold a call to extract this document. The call is evaluated upon a request for load on demand.
  • languageA character vector giving the text's language.
  • loadA logical value indicating whether the document corpus should be immediately loaded into memory.
  • idA character vector representing a unique identification string for the returned text document.
  • The function returns a PlainTextDocument representing the text in content.

Details

Formally this function is a function generator, i.e., it returns a function (which reads in a text document) with a well-defined signature, but can access passed over arguments via lexical scoping. This is especially useful for reader functions for complex data structures which need a lot of configuration options.

Note that this MS Word reader needs the tool antiword installed and accessable on your system.

See Also

Use getReaders to list available reader functions.