bencevans/megadetector-v5a
Detect animals, people, and vehicles in images. Takes a single image as input and outputs structured object detections w...
Found 31 models (showing 21-31)
Detect animals, people, and vehicles in images. Takes a single image as input and outputs structured object detections w...
Detect and segment specified objects in an input image and return RLE-encoded masks as JSON. Accepts an image and a comm...
Label facial blendshapes and pose keypoints from images or videos. Detect up to 1–100 people and return per-frame JSON a...
Estimate human poses in images and output an image annotated with keypoints and skeletons. Uses Ultralytics YOLOv11 Pose...
Extract structured document layout and text from an image input and return a single JSON output. Parse page elements wit...
Detect and classify speech bubbles in manga images. Takes an image and returns bounding boxes, labels, and confidence sc...
Extract text and document structure from images into plain text or Markdown. Accept an image and a task type (markdown,...
Segment objects in images into labeled masks. Accepts an image input and returns semantic segments as per-object masks w...
Detect semantic room wireframes from a single indoor image. Return line segments and junction coordinates with semantic...
Answer questions about images with grounded visual references. Takes an image and a natural-language prompt and returns...
Detect thoracic abnormalities in chest X-ray images and return bounding boxes with labels and confidence scores. Accepts...