lucataco/qwen-vl-chat
Analyzes images and answers questions about them through conversational interaction. Takes an image and a text prompt as...
Found 82 models (showing 1-20)
Analyzes images and answers questions about them through conversational interaction. Takes an image and a text prompt as...
Answer questions about images and extract text from images. Takes an image and a text prompt and returns a text response...
Analyze images to identify unusual or noteworthy elements based on textual prompts. This model processes an input image...
Analyzes images and answers questions about them using a visual language model. Takes an image and a text query as input...
Analyze images and answer questions about them in natural language. Accepts a text prompt and an optional image and retu...
Answer questions about images and perform visual reasoning from an image and a text prompt, returning text. Handle visua...
Analyze images and answer questions from an input image and text instruction, returning text. Support visual question an...
Generate text responses from prompts with advanced reasoning, code generation, and image analysis capabilities. Supports...
Generate text content from prompts with advanced reasoning and coding capabilities. Claude Sonnet 4 supports both standa...
Classify the safety of multimodal inputs (image and user message) for content moderation. Accepts an image (required) an...
Predicts the age of a person in an input image using CLIP by computing the similarity between age-related prompts and th...
Predicts age from an input image using CLIP model.
Generates text responses based on prompts or conversation messages, with support for image input analysis. This is the f...
Generates text responses based on prompts or multi-turn conversations, designed as a faster and more cost-effective vers...
Generate text from prompts or chat messages, with optional image inputs for visual understanding and captioning, and ret...
Generate and reason over text with optional image inputs, returning text outputs. Handle long-context tasks with a 200k-...
Analyze images and return text responses for captioning and visual question answering. Accept an image and a natural-lan...
Generates text responses from prompts using OpenAI's GPT-4o mini model with low latency and cost optimization. Supports...
Generates text responses based on prompts and can analyze images. Excels at coding tasks with state-of-the-art performan...
Generates text responses with built-in reasoning capabilities for complex problem-solving and expert-level analysis. Sup...