pdf_page_batch

This function processes a PDF file page by page. For each page, it extracts the text
and converts the page into an image. It creates a list of LLMMessage objects with
the text and the image for multimodal processing. Users can specify a range of pages
to process and provide a custom function to generate prompts for each page.

A tidy interface for integrating large language model (LLM) APIs such as 'Claude', 'Openai', 'Groq','Mistral' and local models via 'Ollama' into R workflows. The package supports text and media-based interactions, interactive message history, batch request APIs, and a tidy, pipeline-oriented interface for streamlined integration into data workflows. Web services are available at <https://www.anthropic.com>, <https://openai.com>, <https://groq.com>, <https://mistral.ai/> and <https://ollama.com>.

Eduard Brüll

tidyllm

Tidy Integration of Large Language Models

pdf_page_batch function

<dl><dt>.pdf</dt>
<dd>Path to the PDF file.</dd>
<dt>.general_prompt</dt>
<dd>A default prompt that is applied to each page if <code>.prompt_fn</code> is not provided.</dd>
<dt>.system_prompt</dt>
<dd>Optional system prompt to initialize the LLMMessage (default is "You are a helpful assistant").</dd>
<dt>.page_range</dt>
<dd>A vector of two integers specifying the start and end pages to process. If NULL, all pages are processed.</dd>
<dt>.prompt_fn</dt>
<dd>An optional custom function that generates a prompt for each page. The function takes the page text as input
and returns a string. If NULL, <code>.general_prompt</code> is used for all pages.</dd></dl>

Arguments

Batch Process PDF into LLM Messages — pdf_page_batch

<dl>

<dt>.pdf</dt>
<dd>Path to the PDF file.</dd>


<dt>.general_prompt</dt>
<dd>A default prompt that is applied to each page if <code>.prompt_fn</code> is not provided.</dd>


<dt>.system_prompt</dt>
<dd>Optional system prompt to initialize the LLMMessage (default is "You are a helpful assistant").</dd>


<dt>.page_range</dt>
<dd>A vector of two integers specifying the start and end pages to process. If NULL, all pages are processed.</dd>


<dt>.prompt_fn</dt>
<dd>An optional custom function that generates a prompt for each page. The function takes the page text as input
and returns a string. If NULL, <code>.general_prompt</code> is used for all pages.</dd>

</dl>

pdf_page_batch: Batch Process PDF into LLM Messages

Description

Usage

Value

Arguments