- image
The image to generate text
- prompt
The prompt to generate text, Default is "Explain this image"
- model
The model to use. Options are "2.0-flash", "2.0-flash-lite", "2.5-pro-exp-03-25". Default is '2.0-flash'
see https://ai.google.dev/gemini-api/docs/models/gemini
- temperature
The temperature to use. Default is 1 value should be between 0 and 2
see https://ai.google.dev/gemini-api/docs/models/generative-models#model-parameters
- maxOutputTokens
The maximum number of tokens to generate.
Default is 8192 and 100 tokens correspond to roughly 60-80 words.
- topK
The top-k value to use. Default is 40 value should be between 0 and 100
see https://ai.google.dev/gemini-api/docs/models/generative-models#model-parameters
- topP
The top-p value to use. Default is 0.95 value should be between 0 and 1
see https://ai.google.dev/gemini-api/docs/models/generative-models#model-parameters
- seed
The seed to use. Default is 1234 value should be integer
see https://ai.google.dev/gemini-api/docs/models/generative-models#model-parameters
- type
The type of image. Options are 'png', 'jpeg', 'webp', 'heic', 'heif'. Default is 'png'