adirik/grounding-dino 🖼️📝🔢✓ → ❓

▶️ 28.7M runs 📅 Oct 2023 ⚙️ Cog 0.8.3 🔗 GitHub 📄 Paper ⚖️ License

image-object-detection open-vocabulary open-vocabulary-detection text-guided-detection

About

Detect everything with language!

Example Output

Output

{"detections":[{"bbox":[19,204,408,563],"label":"pink mug","confidence":0.8077122569084167},{"bbox":[545,263,952,650],"label":"pink mug","confidence":0.7644544839859009},{"bbox":[416,60,764,380],"label":"pink mug","confidence":0.4754282832145691},{"bbox":[909,161,1078,487],"label":"pink mug","confidence":0.43150201439857483}],"result_image":"https://pbxt.replicate.delivery/oJDgostFveUPEiUVnTRM9MnL7rxEMXIuy65E6K2X638PfFyRA/result.png"}

Performance Metrics

1.68s Prediction Time

1.67s Total Time

All Input Parameters

{
  "image": "https://replicate.delivery/pbxt/JlgUQIQCDemKg7bnfn5zKMqLgAPrZdpMMHzkXgHX5HUlbw9z/mugs.webp",
  "query": "pink mug",
  "box_threshold": 0.2,
  "text_threshold": 0.2,
  "show_visualisation": true
}

Input Parameters

image Type: string: Input image to query
query Type: string: Comma seperated names of the objects to be detected in the image
box_threshold Type: numberDefault: 0.25Range: 0 - 1: Confidence level for object detection
text_threshold Type: numberDefault: 0.25Range: 0 - 1: Confidence level for object detection
show_visualisation Type: booleanDefault: true: Draw and visualize bounding boxes on the image

Output Schema

Output

Version Details

Version ID: efd10a8ddc57ea28773327e881ce95e20cc1d734c589f7dd01d2036921ed78aa
Version Created: October 23, 2023

Run on Replicate →