spuuntries/urna-kp3l
Caption images and answer visual questions from an image and a text prompt. Accepts an input image and an instruction (e...
Found 52 models (showing 1-20)
Caption images and answer visual questions from an image and a text prompt. Accepts an input image and an instruction (e...
Segment objects in images from natural-language instructions and answer visual questions. Provide an image plus a text i...
Segment objects in images from natural-language instructions and answer grounded visual questions. Takes an image and a...
Caption images and answer visual questions from an input image. Optionally evaluate imageβtext matching. Provide an imag...
Answer questions about images and generate captions from a single input image. Provide an image and a natural-language q...
Answer questions about images. Provide an image and a natural-language question to receive a text answer, or switch to c...
Answer questions about images and generate image-grounded text from an image and a text prompt. Perform visual question...
Answer questions about images and generate captions from an input image and a text prompt, returning text. Handle genera...
Analyze images and answer questions from an image plus text prompt, returning text. Handle visual question answering (VQ...
Caption images and answer visual questions from an input image and text query, returning a text response. Handle general...
Answer questions about images and return text. Accepts an image and a natural-language question and outputs a textual an...
Caption images and answer questions about images. Takes an image and a text prompt as input and returns text, enabling i...
Answer questions about images and generate text descriptions. Accepts an image and a natural-language prompt; returns te...
Answer questions about images from a single image input and a text prompt, returning a single-turn text response. Perfor...
Answer questions about images and generate captions from an input image and text prompt. Output free-form text grounded...
Answer questions about images from an image and text prompt, returning text. Perform visual question answering, image ca...
Answer questions about an image and generate captions, returning text based on visual content. Provide a single image an...
Answer questions about images. Takes an image and a natural-language question and returns text, enabling visual question...
Answer questions about images and generate captions from an image and a text prompt, outputting text. Perform visual que...
Answer questions about images and documents from an image and a text prompt, returning text. Handle visual question answ...