lucataco/qwen-vl-chat
Answer questions about images. Accept an image and a text prompt and return text outputs for visual question answering,...
Found 67 models (showing 1-20)
Answer questions about images. Accept an image and a text prompt and return text outputs for visual question answering,...
Analyze images to identify unusual or noteworthy elements based on textual prompts. This model processes an input image...
Answer questions about images and extract text from images. Takes an image and a text prompt and returns a text response...
Analyze images and answer questions about them in natural language. Accepts a text prompt and an optional image and retu...
Analyze images and videos to generate captions, answer visual questions, and summarize scenes. Accepts an image or a vid...
Analyze images and generate text responses to prompts. Accepts an image and a text prompt, and outputs text for visual q...
Answer questions about images. Takes an image and a text prompt, and returns a text answer, enabling visual question ans...
Analyze images and answer questions from an input image and text instruction, returning text. Support visual question an...
Segment objects in images from natural-language instructions and answer grounded visual questions. Takes an image and a...
Generate text and analyze images from prompts or multi-turn messages, returning text outputs. Accept multiple image inpu...
Generate and reason over text from prompts or multi-turn chat, with optional image inputs for vision understanding and i...
Generate and reason over text and code from a prompt, with optional image input for captioning and visual analysis, and...
Answer questions about images and GUI screenshots. Takes an image and a natural-language query and returns a text respon...
Generate text from prompts or chat messages, with optional image analysis for multimodal reasoning. Handle instruction f...
Generate text and analyze images for chat, coding, and reasoning. Accept text prompts or chat messages with optional ima...
Generate text from prompts or chat and analyze images to produce captions and grounded answers. Accepts text and optiona...
Generate and chat in natural language from text prompts, with optional image inputs for visual understanding and image-t...
Generate and reason over text from prompts and optional images. Accept text or chat-style messages and image inputs, and...
Solve complex reasoning tasks and generate text responses from prompts, multi-turn chat messages, and images. Accept a s...
Generate and analyze text and code from a prompt, with optional image input for visual understanding and data extraction...