image-object-detection AI Models - Page 2

bencevans/megadetector-v5a

Detect animals, people, and vehicles in images for camera trap monitoring. Takes an image as input and returns object de...

🖼️ • image-object-detection • camera-trap • wildlife-monitoring • 719 runs

🤖 Model 🖼️

jweek/mask_maker

Detect and segment specified objects in an input image and return RLE-encoded masks as JSON. Accepts an image and a comm...

🖼️ • image-segmentation • image-object-detection • 497 runs

🤖 Model

fire/v-sekai.mediapipe-labeler

Label facial blendshapes and pose keypoints from images or videos. Detect up to 1–100 people and return per-frame JSON a...

pose-estimation • face-detection • facial-blendshapes • 218 runs

🤖 Model 🖼️

hautechai/yolo11x-pose-image

Estimate human poses in images and output an image annotated with keypoints and skeletons. Uses Ultralytics YOLOv11 Pose...

🖼️ • pose-estimation • image-object-detection • 72 runs

🤖 Model 🖼️

sljeff/dots.ocr

Extract structured document layout and text from an image input and return a single JSON output. Parse page elements wit...

🖼️ • ocr • document-to-json • image-object-detection • 4.8K runs

🤖 Model 🖼️

eiby777/manga_globes

Detect and classify speech bubbles in manga images. Takes an image and returns bounding boxes, labels, and confidence sc...

🖼️ • image-object-detection • manga • 7 runs

🤖 Model 🖼️ → 📝

ghostljj/deepseek-ocr

Extract text and convert documents to markdown format from images using optical character recognition. Supports multiple...

🖼️ → 📝 • ocr • pdf-to-markdown • document-to-json • 92 runs

🤖 Model 🖼️

meepo-pro-player/winter-wyvern

Segment objects in images into labeled masks. Accepts an image input and returns semantic segments as per-object masks w...

🖼️ • image-segmentation • image-object-detection • 279.5K runs

🤖 Model 🖼️

davidgillsjo/srw-net

Detect semantic room wireframes from a single indoor image. Return line segments and junction coordinates with semantic...

🖼️ • image-object-detection • room-layout-estimation • wireframe-detection • 3.3K runs

🤖 Model 🖼️ → 📝

perceptron-ai-inc/isaac-0.1

Analyzes images and answers questions about visual content with spatially-aware responses. Takes an image and a text pro...

🖼️ → 📝 • image-to-text • visual-understanding • ocr • 39.1K runs

🤖 Model 🖼️

hyuse202/sef

Detect thoracic abnormalities in chest X-ray images and return bounding boxes with labels and confidence scores. Accepts...

🖼️ • image-object-detection • medical-imaging • x-ray • 2.6K runs

🤖 Model 🖼️

zacharylazzara/tent-detector

Detect tents in satellite or aerial images. Takes a single image as input and outputs an annotated image overlay highlig...

🖼️ • image-object-detection • remote-sensing • satellite-imagery • 36 runs

🤖 Model 🖼️

bencevans/megadetector-v4.1

Detect animals, humans, and vehicles in camera-trap images. Accepts an image and returns an annotated image with boundin...

🖼️ • image-object-detection • camera-trap • 569 runs

🤖 Model 🖼️

hardikdava/rf-detr

Detect objects in images and return bounding boxes, class labels, and confidence scores. Leverage RF-DETR (Roboflow Dete...

🖼️ • image-object-detection • 79 runs

🤖 Model 🖼️

hilongjw/sec_detect

Detect objects in an input image and output an annotated image with bounding boxes, class labels, and optional confidenc...

🖼️ • image-object-detection • 28 runs

🤖 Model 🖼️

meta/detic

Detect objects in images using open-vocabulary class names. Accepts an image plus a vocabulary preset (LVIS, Objects365,...

🖼️ • image-object-detection • open-vocabulary-detection • 28.2K runs

🤖 Model 🖼️

cudanexus/detic

Detect objects in images with open-vocabulary, zero-shot labels and return an annotated image and JSON detections. Accep...

🖼️ • image-object-detection • open-vocabulary • 294 runs

🤖 Model 🖼️

jigsawstack/object-detection

Detect objects in images and optionally return an annotated image. Accepts an image input and optional text prompts to t...

🖼️ • image-object-detection • 17 runs

🤖 Model 📝 → 🖼️

qwen/qwen-image

Generates images from text prompts with exceptional text rendering capabilities, particularly excelling at complex multi...

📝 → 🖼️ • text-to-image • image-to-image • image-editing • 1.8M runs

🤖 Model 🖼️ → 📝

remodela-ai/recognize-anything

Recognizes and identifies objects, text, and other elements in images, returning structured information about detected i...

🖼️ → 📝 • image-to-text • image-object-detection • ocr • 234 runs