Tabula is a Java library designed to computationally extract tables from PDF documents. tabulizer provides a thin R package with bindings to the library. It presently offers two principal functions: extract_tables
, which mimics the command line functionality of Tabula, and extract_areas
which provides an interactive interface to the former.