
spuuntries/urna-kp3l
Answer questions about an image and generate captions from a text prompt and image input. Accepts an image and a natural...
Found 43 models (showing 1-20)
Answer questions about an image and generate captions from a text prompt and image input. Accepts an image and a natural...
Segment objects and regions in images from natural-language instructions and answer visual questions. Takes an image and...
Segment objects and answer questions from an input image using natural-language instructions. Provide an image and an in...
Caption images from a single input image. Answer visual questions about the image and evaluate image-text matching by ch...
Answer questions about images and generate captions. Takes an image and a natural-language question as input and returns...
Answer questions about an input image and generate captions, returning text. Accept an image plus a question, or enable...
Answer questions about images. Accepts an image and a text prompt and returns text, enabling visual question answering,...
Caption images and answer visual questions from an input image and text prompt. Accepts an image and a natural-language...
Answer questions about images and produce text output. Handle image captioning, visual question answering (VQA), optical...
Caption images and answer visual questions from an input image and a text query. Return text responses for VQA, image de...
Answer questions about images and return text. Accepts an image and a natural-language question, and outputs textual res...
Caption images and answer visual questions from an image and a text prompt. Takes a single image plus an instruction (e....
Answer questions about images and generate captions from a text prompt. Accept a single image and a natural-language que...
Answer visual questions and caption images from an input image and text prompt, returning text. Perform single-turn visu...
Caption images and answer questions from an image and a text prompt, returning text. Handle visual question answering (V...
Answer questions about an image from a text prompt and return text. Perform visual question answering, image captioning,...
Answer questions about images. Accepts a single image and a text prompt, and outputs text that captions the image or res...
Answer questions about images. Takes an image and a natural-language question as input and returns a text answer. Suppor...
Answer questions about images and generate captions from an image and a text prompt. Accepts a single image plus a natur...
Answer questions about images and documents from an image and a text prompt, returning text. Handle visual question answ...