
ibm-granite/granite-vision-3.2-2b
Extract and reason over content in document images. Accepts an image and a text prompt or question, and returns text ans...
Found 46 models (showing 41-46)
Extract and reason over content in document images. Accepts an image and a text prompt or question, and returns text ans...
Answer questions about images and generate captions from an image and a natural-language query. Takes an image plus a te...
Answer questions about images. Accept an image and a natural-language prompt and return text, enabling visual question a...
Answer questions about images and extract text from images. Takes an image and a text prompt and returns text, enabling...
Answer questions about images (visual question answering) from an image and a text prompt, returning text. Use a localit...
Answer questions about images from a text prompt and an image input. Accepts an image and instruction, and returns text...