aodianyun/minicpm-v-26
Caption images and videos. Take an image or video plus an optional prompt and return text that describes the visual cont...
Found 166 models (showing 61-80)
Caption images and videos. Take an image or video plus an optional prompt and return text that describes the visual cont...
Generate text descriptions and answers from an input image or video. Accept an optional instruction or question prompt t...
Generate text descriptions for images and videos. Accepts a single image or video plus an optional instruction prompt, a...
Answer questions about images and generate detailed image captions using MiniGPT-4 with Vicuna-13B language model. Takes...
Generate text responses from prompts with advanced reasoning, code generation, and image analysis capabilities. Supports...
Generates text responses with built-in reasoning capabilities for complex problem-solving and expert-level analysis. Sup...
Generate text content from prompts with advanced reasoning and coding capabilities. Claude Sonnet 4 supports both standa...
Generate text responses based on prompts with support for image analysis. Features particularly strong capabilities in c...
Generate text content with structured outputs, web search capabilities, and custom tools based on text prompts and image...
Analyzes images and answers questions about them using MiniGPT-4 with Vicuna-7B language model. Takes an image and an op...
Answer questions about images and generate captions from an image and a text query, returning text. Accept a single imag...
Generate text from prompts or chat messages, with optional image inputs for visual understanding and captioning, and ret...
Generate text based on text prompts and optional image inputs. This multimodal language model handles both text and imag...
Generates text responses from text prompts and optional image inputs. Supports multimodal capabilities for analyzing and...
Generates text based on text prompts and optional image inputs. Handles multimodal tasks combining text and image analys...
Generate text and analyze images from a text prompt (optionally with an image), returning text for conversation, caption...
Caption images and answer visual questions from an input image and text prompt, returning text. Handle multilingual outp...
Answer questions about images and generate captions from an image input. Takes an image and a text prompt (e.g., βDescri...
Answer questions about images. Takes an image and a text prompt and returns a text response, enabling visual question an...
Caption images. Takes an input image and returns a short natural-language description as text, useful for alt text, acce...