lucataco/florence-2-large 🖼️❓📝 → ❓

▶️ 465.0K runs 📅 Jun 2024 ⚙️ Cog 0.9.9 🔗 GitHub 📄 Paper ⚖️ License
image-object-detection image-to-text ocr

About

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Example Output

Output

{"img":"https://replicate.delivery/pbxt/OSFBuet9KTQOF6BpGa2zlkjuYgcyVHNLB0v9wB4bjlielOCTA/output.png","text":"{'': {'bboxes': [[34.23999786376953, 160.0800018310547, 597.4400024414062, 371.7599792480469], [456.0, 97.68000030517578, 580.1599731445312, 261.8399963378906], [450.8800048828125, 276.7200012207031, 554.5599975585938, 370.79998779296875], [95.68000030517578, 280.55999755859375, 198.72000122070312, 371.2799987792969]], 'labels': ['car', 'door', 'wheel', 'wheel']}}"}

Performance Metrics

2.25s Prediction Time
2.29s Total Time
All Input Parameters
{
  "image": "https://replicate.delivery/pbxt/L9zDhV2KiVnudUyRiNjt9P18LZ98Hrqq5GGdx9szmBCAyEhP/car.jpg",
  "task_input": "Object Detection"
}
Input Parameters
image (required) Type: string
Grayscale input image
task_input Default: Caption
Input task
text_input Type: string
Text Input(Optional)
Output Schema
img Type: stringFormat: uri
Img
text Type: string
Text
Version Details
Version ID
da53547e17d45b9cfb48174b2f18af8b83ca020fa76db62136bf9c6616762595
Version Created
June 25, 2024
Run on Replicate →