
anthropic/claude-4-sonnet
Generate text and code from a prompt, with optional image input for captioning and visual analysis. Supports fast standa...
Found 65 models (showing 21-40)
Generate text and code from a prompt, with optional image input for captioning and visual analysis. Supports fast standa...
Generate text quickly for chat, question answering, code generation, classification, summarization, and translation. Acc...
Generate text for chat, coding, and reasoning from text prompts or multi-turn messages, with optional image input for an...
Solve complex reasoning tasks and generate text from prompts or chat messages. Accept text or messages and optional imag...
Generate and reason over text from a prompt, optionally analyze images to extract data and answer visual questions. Prod...
Answer questions about images and generate text from an image plus a prompt. Accepts a single image and textual instruct...
Generate captions for images using a simple GPT-5-mini wrapper. Input an image and receive a descriptive text output tha...
Answer questions about images and generate captions from an input image and text prompt. Accepts an image plus a message...
Generate text and code from prompts, with optional image analysis and visual question answering. Accepts a text prompt a...
Answer visual questions and caption images from a text prompt and an optional image, returning text. Support multi-turn...
Caption images and answer visual questions from an image and a text prompt. Accepts an image and a message (question or...
Answer questions about images and generate captions from an image and a natural-language query. Takes an image plus a te...
Analyze images in conversational chat to answer questions, caption scenes, and localize objects with bounding boxes. Acc...
Caption images. Takes a single image input and returns a concise natural-language description of the scene, suitable for...
Caption images and answer questions about an input image, returning text. Provide an image and an optional natural-langu...
Caption images. Take an image input and return a natural-language description of its contents as text (single caption)....
Generate captions for images by combining three input images using a mathematical operation. The model outputs text desc...
Caption images. Input a single image and generate a natural-language description using visual attention that focuses on...
Answer questions about images from a text prompt. Accepts an image and a natural-language prompt and returns generated t...
Generate fine-grained captions for images using a CLIP-based reward system. This model evaluates image captions based on...