A character string specifying the path or URL to a PDF file.
doc
Optionally,, in lieu of file, an rJava reference to a PDDocument Java object.
pages
An optional integer vector specifying pages to extract from.
password
Optionally, a character string containing a user password to access a secured PDF.
copy
Specifies whether the original local file(s) should be copied to
tempdir() before processing. FALSE by default. The argument is
ignored if file is URL.
Value
For get_n_pages, an integer. For get_page_dims, a list of two-element numeric vectors specifying the width and height of each page, respectively.
Details
get_n_pages returns the page length of a PDF document. get_page_dims extracts the dimensions of specified pages in a PDF document. This can be useful for figuring out how to specify the area argument in extract_tables