ibm-granite/granite-vision-3.2-2b
Extract content and answer questions from images of documents. Takes an image plus a text prompt or question and outputs...
Found 52 models (showing 41-52)
Extract content and answer questions from images of documents. Takes an image plus a text prompt or question and outputs...
Answer questions about images and generate captions from an image and a text query, returning text. Accept a single imag...
Answer questions about images. Takes an image and a text prompt and returns a text response, enabling visual question an...
Answer questions about images and extract text from images. Takes an image and a text prompt and returns a text response...
Answer questions about images from a text prompt and an image input, returning a text response. Perform visual question...
Answer questions about images from an image and text prompt, returning text responses. Perform visual question answering...
Answer questions about images. Takes an image and a text prompt, and returns a text answer, enabling visual question ans...
Caption images and answer visual questions from an input image and text prompt. Accepts an image and a prompt; outputs t...
Answer questions about images and GUI screenshots. Takes an image and a natural-language query and returns a text respon...
Answer questions about images. Accept an image and a text prompt and return text outputs for visual question answering,...
Caption images and answer visual questions from an input image, returning text. Accept an image and an optional instruct...
Caption images and answer visual questions from an image plus an optional text prompt, returning text. Handle OCR-style...