myEuropresseArticles.html
).
Frequently, it is necessary to specify the encoding of the texts
via link{EuropresseSource}
's encoding
argument.
# Import corpus source <- EuropresseSource("myEuropresseArticles.html") corpus <- Corpus(source)
# See how many articles were imported corpus
# See the contents of the first article and its meta-data inspect(corpus[1]) meta(corpus[[1]])
See link{EuropresseSource}
for more details and real examples.