
yorickvp/llava-v1.6-mistral-7b
Answer visual questions and caption images from a text prompt and an optional image, returning text. Support multi-turn...
Found 45 models (showing 21-40)
Answer visual questions and caption images from a text prompt and an optional image, returning text. Support multi-turn...
Analyze images and return text responses for captioning and visual question answering. Accept an image and a natural-lan...
Generate captions for images using a simple GPT-5-mini wrapper. Input an image and receive a descriptive text output tha...
Generate text and code from prompts, with optional image analysis and visual question answering. Accepts a text prompt a...
Answer questions about images. Takes an image and a natural-language question as input and returns a text answer. Suppor...
Generate text and understand images from text and optional image inputs. Handle chat, question answering, document summa...
Analyze images in conversational chat to answer questions, caption scenes, and localize objects with bounding boxes. Acc...
Answer questions about images and produce text output. Handle image captioning, visual question answering (VQA), optical...
Caption images and answer visual questions from a text prompt and an optional image, returning text. Support long-contex...
Answer questions about images and extract information, returning text. Accepts an image plus a text prompt and outputs t...
Caption and answer questions about images. Accepts an image and a natural-language prompt, returning text descriptions o...
Caption images and answer visual questions from an input image and a text query. Return text responses for VQA, image de...
Answer visual questions and caption images from an input image and text prompt, returning text. Perform single-turn visu...
Answer questions about images. Accepts an image and an optional text prompt and returns a text response for visual quest...
Answer questions about images and generate captions from an image plus a text prompt, returning text. Analyze photos, do...
Caption images and answer visual questions from an image and a text prompt, returning text. Add visual understanding to...
Answer questions about images. Accepts an image and a text prompt and returns text, enabling visual question answering,...
Generate text and analyze images from text and image inputs. Handle question answering, longβcontext reasoning (128K tok...
Answer questions about images and documents from an image and a text prompt, returning text. Handle visual question answ...
Caption images and answer visual questions from an input image, returning text. Accept an image plus an instruction prom...