bencevans/megadetector-v5a
Detect animals, people, and vehicles in images for camera trap monitoring. Takes an image as input and returns object de...
Found 37 models (showing 21-37)
Detect animals, people, and vehicles in images for camera trap monitoring. Takes an image as input and returns object de...
Detect and segment specified objects in an input image and return RLE-encoded masks as JSON. Accepts an image and a comm...
Label facial blendshapes and pose keypoints from images or videos. Detect up to 1–100 people and return per-frame JSON a...
Estimate human poses in images and output an image annotated with keypoints and skeletons. Uses Ultralytics YOLOv11 Pose...
Extract structured document layout and text from an image input and return a single JSON output. Parse page elements wit...
Detect and classify speech bubbles in manga images. Takes an image and returns bounding boxes, labels, and confidence sc...
Extract text and document structure from images into plain text or Markdown. Accept an image and a task type (markdown,...
Segment objects in images into labeled masks. Accepts an image input and returns semantic segments as per-object masks w...
Detect semantic room wireframes from a single indoor image. Return line segments and junction coordinates with semantic...
Answer questions about images with grounded visual references. Takes an image and a natural-language prompt and returns...
Detect thoracic abnormalities in chest X-ray images and return bounding boxes with labels and confidence scores. Accepts...
Detect tents in satellite or aerial images. Takes a single image as input and outputs an annotated image overlay highlig...
Detect animals, humans, and vehicles in camera-trap images. Accepts an image and returns an annotated image with boundin...
Detect objects in images and return bounding boxes, class labels, and confidence scores. Leverage RF-DETR (Roboflow Dete...
Detect objects in an input image and output an annotated image with bounding boxes, class labels, and optional confidenc...
Detect objects in images using open-vocabulary class names. Accepts an image plus a vocabulary preset (LVIS, Objects365,...
Detect objects in images with open-vocabulary, zero-shot labels and return an annotated image and JSON detections. Accep...