lidarbtc/kollava-v1.5
Answer questions about images in Korean. Take an image and a Korean prompt and generate Korean text for visual question...
Found 166 models (showing 41-60)
Answer questions about images in Korean. Take an image and a Korean prompt and generate Korean text for visual question...
Answer questions about images and generate captions from an image input and a natural-language question, returning text....
Caption images and answer visual questions from an input image. Provide an image and either generate a caption or ask a...
Generate text prompts from an input image for use with text-to-image models. Analyze artists, mediums, and styles using...
Predict a person's age from an input image. Takes a photo containing a face and returns an estimated age (1–99) as text....
Answer questions about images from an image and text prompt, returning text. Perform visual question answering (VQA), im...
Generate text prompts from an input image. Combine CLIP and BLIP to analyze the image and produce descriptive prompts op...
Generate text responses with advanced reasoning capabilities, specializing in math, coding, and visual analysis. Process...
Identify bird species and answer bird-related questions from an input image and text prompt, returning text. Perform vis...
Answer questions about images and GUI screenshots. Takes an image and a natural-language query and returns a text respon...
Generate text responses from prompts with support for image analysis and visual understanding. Fast, lightweight languag...
Generates text responses from prompts using OpenAI's GPT-4o mini model with low latency and cost optimization. Supports...
Generates text responses from text prompts, messages, and images with multimodal capabilities. Processes both text and v...
Generate text responses for complex tasks with 1 million token context window and multimodal capabilities. Features impr...
Generate and reason over text with optional image inputs, returning text outputs. Handle long-context tasks with a 200k-...
Analyze images and answer questions about them in natural language. Accepts a text prompt and an optional image and retu...
Answer questions about images and generate captions from a text prompt and an optional image, returning text. Perform vi...
Analyze images and return text responses for captioning and visual question answering. Accept an image and a natural-lan...
Generates text descriptions, stories, and responses based on input images and prompts. Takes an image and text prompt as...
Answer questions about images. Accepts an image and a natural-language question and returns a text answer for visual que...