datalab-to/ocr 🖼️🔢✓📝 → ❓

⭐ Official ▶️ 28.6K runs 📅 Oct 2025 ⚙️ Cog 0.16.8

About

Detect and transcribe text in images with accurate bounding boxes, layout analysis, reding order, and table recognition, in 90 languages

Example Output

Output

{"text":"One Ring to rule them all
One Ring to find them
One Ring to bring them all
and in the darkness bind them
In the Land of Mordor
where the Shadows lie","pages":null,"page_count":null,"visualizations":["https://replicate.delivery/xezq/94Y888Ag936zHJVPRelwGT1eIKoAviU6w9TOSrF6BiVkmvfqA/visualization.jpg"]}

Performance Metrics

5.44s Prediction Time

5.45s Total Time

All Input Parameters

{
  "file": "https://collections-zoo-output.replicate.dev/predictions/78j9fndvp1rmc0csx5gbhrqgpg/1760557063643-jaj2s8zkj6.jpg",
  "visualize": true,
  "skip_cache": false,
  "return_pages": false
}

Input Parameters

file (required) Type: string: Input file. Must be one of: .pdf, .doc, .docx, .ppt, .pptx, .png, .jpg, .jpeg, .webp
max_pages Type: integerRange: 1 - ∞: Maximum number of pages to process. Cannot be specified if page_range is set - these parameters are mutually exclusive
visualize Type: booleanDefault: false: Draw red polygons on the input image(s) to visualize detected text regions and return the annotated images
page_range Type: string: Page range to parse, comma separated like 0,5-10,20. Example: '0,2-4' will process pages 0, 2, 3, and 4. Cannot be specified if max_pages is set - these parameters are mutually exclusive
skip_cache Type: booleanDefault: false: Bypass the server-side cache and force re-processing. By default, identical requests are cached to save time and cost. Enable this to get fresh results
return_pages Type: booleanDefault: false: Return detailed page information including text lines, bounding boxes, polygons, and character-level data. When disabled, only text and page_count will be returned

Output Schema

Example Execution Logs

Processing OCR with request ID: dogWFkJYbhvM-6FamjsYDA
OCR processed in 5.2sec

Version Details

Version ID: 3e6db0d5311d6fdc232eea333c1e26055ba4e542180043f12acb2967e5c77f4a
Version Created: October 20, 2025

Run on Replicate →