image-analysis AI Models - Page 2

peter65374/cog-resnet

Classify images by identifying objects and assigning confidence scores to each detected object.

🖼️ • image-classification • object-detection • image-analysis • 11 runs

yorickvp/llava-v1.6-mistral-7b

Multimodal language model that analyzes images and generates text responses based on visual content and text prompts. Bu...

🖼️ → 📝 • image-to-text • text-generation • visual-understanding • 5.0M runs

🤖 Model 🖼️ → 📝

openai/gpt-4.1-nano

Generates text responses from prompts with ultra-low latency and fast response times. Supports up to 1 million token con...

🖼️ → 📝 • text-generation • image-to-text • text-embedding • 2.3M runs

🤖 Model 🖼️ → 📝

yimi81/yi-vl-6b

Answer questions about images and generate captions from an image and a text query, returning text. Accept a single imag...

🖼️ → 📝 • image-to-text • visual-question-answering • image-captioning • 309 runs

🤖 Model 🖼️

scamai/deepfake-faceswap-detection

Detect the likelihood of deepfake faceswaps in images. This model focuses on identifying faceswaps with high confidence,...

🖼️ • deepfake-detection • faceswap-detection • image-analysis • 94 runs

🤖 Model 🖼️ → 📝

paragekbote/gemma3-torchao-quant-sparse

Generate text and analyze images from a text prompt (optionally with an image), returning text for conversation, caption...

🖼️ → 📝 • text-generation • image-to-text • image-captioning • 54 runs

🤖 Model 🖼️ → 📝

jyoung105/imp

Answer questions about images. Takes an image and a text prompt and returns a text response, enabling visual question an...

🖼️ → 📝 • image-to-text • visual-question-answering • 71 runs

🤖 Model 🖼️

bxclib2/test-input-file

Compute an integer from an input image. Accepts an image and outputs a numeric value, useful for testing image input pip...

🖼️ • image-metadata • 43 runs

🤖 Model 🖼️

j-min/clip-caption-reward

Generate fine-grained captions for images using a CLIP-based reward system. This model evaluates image captions based on...

🖼️ • image-captioning • clip-reward • fine-grained-captioning • 296.1K runs

🤖 Model 🖼️ → 📝

deepseek-ai/deepseek-vl2-small

Analyze images and answer questions about visual content using a Mixture-of-Experts vision-language model. Takes an imag...

🖼️ → 📝 • image-to-text • ocr • text-generation • 6.5K runs

🤖 Model 🖼️ → 📝

zsxkib/kimi-vl-a3b-thinking

Answer questions about images and text with multimodal reasoning. Takes a text prompt with an optional image and outputs...

🖼️ → 📝 • image-to-text • text-generation • image-analysis • 988 runs

🤖 Model 🖼️ → 📝

openai/gpt-4o

Generates text responses from text prompts, messages, and images with multimodal capabilities. Processes both text and v...

🖼️ → 📝 • text-generation • image-to-text • code-generation • 723.2K runs

🤖 Model 🖼️ → 📝

openai/o4-mini

Generate text responses with advanced reasoning capabilities, specializing in math, coding, and visual analysis. Process...

🖼️ → 📝 • text-generation • image-to-text • code-generation • 470.2K runs

🤖 Model 🖼️ → 📝

zsxkib/molmo-7b

Answers questions and generates captions about images using a 7B parameter vision-language model. Based on Qwen2-7B and...

🖼️ → 📝 • image-to-text • visual-understanding • image-analysis • 1.3M runs

🤖 Model 🖼️ → 📝

lucataco/llama-3-vision-alpha

Analyzes images and generates text descriptions or answers questions about visual content. Uses a projection module trai...

🖼️ → 📝 • image-to-text • text-generation • visual-understanding • 6.8K runs

🤖 Model 🖼️ → 📝

nomagick/qwen-vl-chat

Generates text responses based on text prompts and images with ChatML prompt interface and streaming support. Accepts up...

🖼️ → 📝 • text-generation • image-to-text • image-analysis • 1.1K runs

🤖 Model 🖼️ → 📝

openai/gpt-4.1-mini

Generate text responses from prompts with support for image analysis and visual understanding. Fast, lightweight languag...

🖼️ → 📝 • text-generation • image-to-text • code-generation • 2.6M runs

🤖 Model 🖼️ → 📝

google-deepmind/gemma-3-4b-it

Generate text based on text prompts and optional image inputs. This multimodal language model handles both text and imag...

🖼️ → 📝 • text-generation • image-to-text • code-generation • 13.3K runs

🤖 Model 🖼️ → 📝

lucataco/smolvlm-instruct

Analyzes images and generates text responses based on visual content and text prompts. Accepts arbitrary sequences of im...

🖼️ → 📝 • image-to-text • visual-understanding • document-understanding • 8.3K runs

🤖 Model 🖼️ → 📝

chenxwh/cogvlm2

Caption images and answer visual questions from an image plus an optional text prompt, returning text. Handle OCR-style...

🖼️ → 📝 • image-to-text • ocr • visual-question-answering • 6.6K runs