lucataco/florence-2-large 🖼️❓📝 → ❓

▶️ 2.0M runs 📅 Jun 2024 ⚙️ Cog 0.9.9 🔗 GitHub 📄 Paper ⚖️ License

image-object-detection image-to-text ocr

Performance

2.2sTypical run time

2.0MTotal runs

About

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Example Output

Output

{"img":"https://replicate.delivery/pbxt/OSFBuet9KTQOF6BpGa2zlkjuYgcyVHNLB0v9wB4bjlielOCTA/output.png","text":"{'': {'bboxes': [[34.23999786376953, 160.0800018310547, 597.4400024414062, 371.7599792480469], [456.0, 97.68000030517578, 580.1599731445312, 261.8399963378906], [450.8800048828125, 276.7200012207031, 554.5599975585938, 370.79998779296875], [95.68000030517578, 280.55999755859375, 198.72000122070312, 371.2799987792969]], 'labels': ['car', 'door', 'wheel', 'wheel']}}"}

Performance Metrics

2.25s Prediction Time

2.29s Total Time

All Input Parameters

{
  "image": "https://replicate.delivery/pbxt/L9zDhV2KiVnudUyRiNjt9P18LZ98Hrqq5GGdx9szmBCAyEhP/car.jpg",
  "task_input": "Object Detection"
}

Input Parameters

image (required) Type: string: Grayscale input image
task_input Default: Caption: Input task
text_input Type: string: Text Input(Optional)

Output Schema

img Type: stringFormat: uri: Img
text Type: string: Text

Version Details

Version ID: da53547e17d45b9cfb48174b2f18af8b83ca020fa76db62136bf9c6616762595
Version Created: June 25, 2024

Run on Replicate →