lucataco/florence-2-base 🖼️❓📝 → ❓

▶️ 125.2K runs 📅 Jun 2024 ⚙️ Cog 0.9.9 🔗 GitHub 📄 Paper ⚖️ License
image-object-detection image-to-text ocr

About

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Example Output

Output

{"text":"{'<CAPTION>': 'A green car parked in front of a yellow building.'}"}

Performance Metrics

0.82s Prediction Time
69.52s Total Time
All Input Parameters
{
  "image": "https://replicate.delivery/pbxt/L9z39PBucXIWQM8fgd1M5XQdiGDWpD07EUdMuncsCVim9YQb/car.jpg",
  "task_input": "Caption"
}
Input Parameters
image (required) Type: string
Grayscale input image
task_input Default: Caption
Input task
text_input Type: string
Text Input(Optional)
Output Schema
img Type: stringFormat: uri
Img
text Type: string
Text
Version Details
Version ID
c81609117f666d3a86b262447f80d41ac5158a76adb56893301843a23165eaf8
Version Created
June 25, 2024
Run on Replicate →