sai88uk/minicpm-v-45-v9
Answer questions about images and videos. Accepts an image or a video plus a question and returns text, enabling visual...
Found 70 models (showing 61-70)
Answer questions about images and videos. Accepts an image or a video plus a question and returns text, enabling visual...
Generate text from prompts or chat messages, with optional image inputs for multimodal reasoning and captioning. Accept...
Analyze images and return text responses for captioning and visual question answering. Accept an image and a natural-lan...
Generate and reason over text, with optional image input for visual understanding and captioning. Takes a text prompt an...
Caption images and answer questions about images. Takes an image plus an optional question and prior Q/A context and ret...
Answer questions about images with step-by-step reasoning. Take an image and an optional text prompt and output text, in...
Answer questions about images from a text prompt and an image input. Accepts an image and instruction, and returns text...
Generate structured JSON or free-form text from prompts. Accept text and image inputs to analyze visuals and return capt...
Generates detailed textual descriptions of images based on input prompts. Utilizes a vision-language model to analyze an...
Answer questions about images and generate captions from an image and a text prompt, returning text. Perform visual ques...