boilerpipeR-package: Extract the main content from HTML files
Description
boilerpipeR interfaces the boilerpipe Java library, created by Christian
Kohlschutter https://github.com/kohlschutter/boilerpipe. It implements robust heuristics
to extract the main content from HTML files, removing unessecary
elements like ads, banners and headers/footers.