Learn R Programming

tm.plugin.webmining (version 1.3)

readWeb: Read content from WebXMLSource/WebHTMLSource/WebJSONSource.

Description

readWeb is a FunctionGenerator which specifies content retrieval from a WebSource content elements. Currently, it is defined for XML, HTML and JSON feeds through readWebXML, readWebHTML and readWebJSON. Also content parsers (xml_content, json_content) need to be defined.

Usage

readWeb(spec, doc, parser, contentparser, freeFUN = NULL)

Arguments

spec
specification of content reader
doc
document to be parsed
parser
parser function to be used
contentparser
content parser function to be used, see also tm:::xml_content or json_content
freeFUN
function to free memory from parsed object (actually only relevant for XML and HTML trees)

Value

FunctionGenerator