
cuuupid/glm-4v-9b
Answer questions about images and extract text from images. Takes an image and a text prompt and returns text, enabling...
Found 67 models (showing 41-60)
Answer questions about images and extract text from images. Takes an image and a text prompt and returns text, enabling...
Caption images and answer visual questions from a text prompt and an optional image, returning text. Support long-contex...
Answer questions about images and extract information, returning text. Accepts an image plus a text prompt and outputs t...
Caption and answer questions about images. Accepts an image and a natural-language prompt, returning text descriptions o...
Generate text responses from prompts or chat messages, with optional image inputs for visual reasoning. Accepts text and...
Answer questions about an input image. Accepts an image and a natural-language question and returns a text answer, enabl...
Caption images. Provide an image and get a concise natural-language description of its contents for alt text, content ta...
Answer questions about images and generate captions and summaries. Accepts one image and a natural-language question and...
Caption images from a single input image. Answer visual questions about the image and evaluate image-text matching by ch...
Caption images and answer visual questions from an input image and a text query. Return text responses for VQA, image de...
Answer questions about images. Accepts an image and an optional text prompt and returns a text response for visual quest...
Generate and reason over text from prompts or chat messages, with optional image inputs, returning text outputs. Solve m...
Answer questions about images and generate captions from an image plus a text prompt, returning text. Analyze photos, do...
Caption images and answer visual questions from an input image, returning text. Accepts an image and a natural-language...
Generate text and understand images from text and optional image inputs. Handle chat, question answering, document summa...
Answer questions about images and documents from an image and a text prompt, returning text. Handle visual question answ...
Caption images and answer visual questions from an input image, returning text. Accept an image plus an instruction prom...
Generate and converse with text from prompts. Accepts text input (and optional images) and returns text for chat, questi...
Generate text and analyze images from prompts or chat messages, optimized for low latency and cost. Accepts text and opt...
Generate and reason over text from prompts, with optional image analysis. Accepts text and an optional image, and return...