jigsawstack/vocr 📝❓🔢 → 📝

▶️ 22 runs 📅 Jun 2025 ⚙️ Cog 0.15.8

document-to-json image-to-text ocr

Performance

17.6sTypical run time

22Total runs

About

Recognise, describe and retrieve data within an image with great accuracy.

Example Output

Prompt:

"describe the image in detail"

Output

/tmp/tmp1l_d4mvw/vocr_result.json

Performance Metrics

17.57s Prediction Time

17.57s Total Time

All Input Parameters

{
  "url": "https://media.snopes.com/2021/08/239918331_10228097135359041_3825446756894757753_n.jpg",
  "prompt": "describe the image in detail",
  "api_key": "sk_ba8d51db7829aef7ec445bb80dd59b7ff2c320d851eb6ae4cf41aacfab4df81c690774957983ebcc02f08429fccdfe42b4d8ee6a40b404399c10ca3bcabdbb72024KnIELzWIzzblw1gtM6"
}

Input Parameters

url Type: string: The image URL. Not required if file_store_key is specified.
prompt Default: Describe the image in detail.: The prompt used to describe the image. Can be a string or array of strings for specific data extraction. Default is 'Describe the image in detail.'
api_key Type: string: 🔐 Your JigsawStack API Key (required)
page_range Type: array: Specifies a range of pages to process with VOCR, provided as a two-element array: [startPage, endPage]. For example, [1, 10] processes pages 1 through 10. The range must span 10 or fewer pages.
file_store_key Type: string: The key used to store the image on Jigsawstack File Storage. Not required if url is specified.

Output Schema

Output

Type: string

Version Details

Version ID: 89fee97b435eed5a19f9e1a7279e48da91fe639483d82c941327b52972ce0503
Version Created: June 27, 2025

Run on Replicate →