lucataco/florence-2-large 🖼️❓📝 → ❓
About
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Example Output
Output
{"img":"https://replicate.delivery/pbxt/OSFBuet9KTQOF6BpGa2zlkjuYgcyVHNLB0v9wB4bjlielOCTA/output.png","text":"{'': {'bboxes': [[34.23999786376953, 160.0800018310547, 597.4400024414062, 371.7599792480469], [456.0, 97.68000030517578, 580.1599731445312, 261.8399963378906], [450.8800048828125, 276.7200012207031, 554.5599975585938, 370.79998779296875], [95.68000030517578, 280.55999755859375, 198.72000122070312, 371.2799987792969]], 'labels': ['car', 'door', 'wheel', 'wheel']}}"}
Performance Metrics
2.25s
Prediction Time
2.29s
Total Time
All Input Parameters
{ "image": "https://replicate.delivery/pbxt/L9zDhV2KiVnudUyRiNjt9P18LZ98Hrqq5GGdx9szmBCAyEhP/car.jpg", "task_input": "Object Detection" }
Input Parameters
- image (required)
- Grayscale input image
- task_input
- Input task
- text_input
- Text Input(Optional)
Output Schema
- img
- Img
- text
- Text
Version Details
- Version ID
da53547e17d45b9cfb48174b2f18af8b83ca020fa76db62136bf9c6616762595
- Version Created
- June 25, 2024