Learn R Programming

tm.plugin.webmining (version 1.3)

WebCorpus: WebCorpus constructor function.

Description

WebCorpus adds further methods and meta data to Corpus and therefore constructs a derived class of Corpus. Most importantly, WebCorpus calls $PostFUN on the generated WebCorpus, which retrieves the main content for most implemented WebSources. Thus it enables an efficient retrieval of new feed items (corpus.update). All additional WebCorpus fields are added to tm$meta like $source, $readerControl and $postFUN.

Usage

WebCorpus(x, readerControl = list(reader = reader(x), language = "en"), postFUN = x$postFUN, retryEmpty = TRUE, ...)

Arguments

x
object of type Source, see also Corpus
readerControl
specifies reader to be used for Source, defaults to list(reader = x$DefaultReader, language = "en"
postFUN
function to be applied to WebCorpus after web retrieval has been completed, defaults to x$PostFUN
retryEmpty
specifies if retrieval for empty content elements should be repeated, defaults to TRUE
...
additional parameters for Corpus function (actually Corpus reader)