- .llm
An LLMMessage
object containing the conversation history.
- .model
The identifier of the model to use (default: "llama-3.2-11b-vision-preview").
- .max_tokens
The maximum number of tokens that can be generated in the response (default: 1024).
- .temperature
Controls the randomness in the model's response. Values between 0 and 2 are allowed, where higher values increase randomness (optional).
- .top_p
Nucleus sampling parameter that controls the proportion of probability mass considered. Values between 0 and 1 are allowed (optional).
- .frequency_penalty
Number between -2.0 and 2.0. Positive values penalize repeated tokens, reducing likelihood of repetition (optional).
- .presence_penalty
Number between -2.0 and 2.0. Positive values encourage new topics by penalizing tokens that have appeared so far (optional).
- .stop
One or more sequences where the API will stop generating further tokens. Can be a string or a list of strings (optional).
- .seed
An integer for deterministic sampling. If specified, attempts to return the same result for repeated requests with identical parameters (optional).
- .api_url
Base URL for the Groq API (default: "https://api.groq.com/").
- .json
Whether the response should be structured as JSON (default: FALSE).
- .timeout
Request timeout in seconds (default: 60).
- .verbose
If TRUE, displays additional information after the API call, including rate limit details (default: FALSE).
- .stream
Logical; if TRUE, streams the response piece by piece (default: FALSE).
- .dry_run
If TRUE, performs a dry run and returns the constructed request object without executing it (default: FALSE).
- .max_tries
Maximum retries to peform request