
lidarbtc/kollava-v1.5
Answer questions about images in Korean. Takes an image and a Korean text prompt and returns Korean text, supporting vis...
Found 46 models (showing 21-40)
Answer questions about images in Korean. Takes an image and a Korean text prompt and returns Korean text, supporting vis...
Answer questions about images. Provide an image and a natural-language question and receive a text response. Handle gene...
Caption images and answer questions about images. Takes an image plus an optional question and prior Q/A context and ret...
Answer questions about an input image and generate captions. Takes an image plus a text prompt and returns a text respon...
Identify bird species and answer bird-related questions from an input image and text prompt. Accepts a bird photo and a...
Answer questions about images and generate text from an image plus a prompt. Accepts a single image and textual instruct...
Answer questions about an input image. Accepts an image and a natural-language question and returns a text answer, enabl...
Answer questions about images and generate captions from an input image and text prompt. Accepts an image plus a message...
Caption images and answer visual questions from an image and a text prompt. Accepts an image and a message (question or...
Caption images and answer visual questions from an image plus a text prompt, returning text. Support multilingual prompt...
Caption images and answer questions about images. Accepts an image and a natural-language prompt (e.g., βDescribe this i...
Answer questions about images from an image and text prompt, returning text. Support visual question answering, image ca...
Answer questions about images from an image and a text prompt, returning text. Perform visual question answering and gro...
Answer questions about images from a text prompt. Accepts an image and a natural-language prompt and returns generated t...
Answer questions about images and documents and generate captions from an image plus a text prompt, returning text. Sele...
Answer questions and caption images from one to three input images, returning text. Handle visual question answering (VQ...
Answer questions about images and generate captions and summaries. Accepts one image and a natural-language question and...
Caption images and answer visual questions from an input image, returning text. Accepts an image and a natural-language...
Caption images and answer visual questions from an image and a text prompt, returning text. Add visual understanding to...
Extract structured information and answer questions from documents, charts, tables, diagrams, and general images. Accept...