document-to-json AI Models

bytedance/dolphin

Convert PDFs or document images into Markdown or structured JSON with layout-aware OCR and element parsing. Perform page...

pdf-to-markdown • document-to-json • ocr • 968 runs

🤖 Model

cuuupid/marker

Convert scanned or digital documents to Markdown. Accepts PDF, EPUB, MOBI, XPS, and FB2 inputs and outputs Markdown plus...

pdf-to-markdown • ocr • 2.8K runs

🤖 Model

willywongi/donut

Extract structured data from receipt images into JSON. Input a receipt image; output structured key–value fields and lin...

document-to-json • ocr • receipt-parsing • 2.2K runs

🤖 Model 📝 → 📝

kshitijagrwl/pii-extractor-llm

Extracts personally identifiable information (PII) from text input, specifically trained to recognize Indian names and a...

📝 → 📝 • text-generation • data-analysis • document-processing • 169 runs

🤖 Model 🖼️

sljeff/dots.ocr

Extract structured document layout and text from an image input and return a single JSON output. Parse page elements wit...

🖼️ • ocr • document-to-json • image-object-detection • 4.8K runs

🤖 Model

jigsawstack/ai-scrape

Extract structured data from web pages into JSON. Accept a URL or raw HTML plus natural-language element prompts, scope...

document-to-json • web-scraping • 23 runs

🤖 Model

rubenamtz/law2entity

Extract entities and relationships from Spanish legal text into structured JSON for knowledge graph construction and leg...

document-to-json • entity-extraction • legal-nlp • 5 runs

🤖 Model

lucataco/olmocr-7b

Extract text and document metadata from PDF pages, returning structured JSON. Accepts a PDF and page number, then transc...

ocr • document-to-json • 4.0K runs

🤖 Model 🖼️ → 📝

jigsawstack/vocr

Extract text and structured data from images and multi-page PDFs using visual OCR and layout analysis. Accept an image o...

🖼️ → 📝 • ocr • document-to-json • image-to-text • 20 runs

🤖 Model

creatorrr/genre

Link entities in text to Wikipedia pages. Accepts text and returns disambiguated entity mentions with start/end characte...

entity-linking • wikification • named-entity-disambiguation • 44 runs

🤖 Model

sulthonmb/ocr-receipt

Extract structured purchase data from receipt images as JSON. Input a receipt image and output JSON with line items, qua...

ocr • document-to-json • receipt-parsing • 675 runs

🤖 Model

datalab-to/marker

Convert documents to Markdown and structured JSON. Accept PDF, DOC/DOCX, PPT/PPTX, and image files (PNG/JPG/WEBP) as inp...

pdf-to-markdown • document-to-json • ocr • 361 runs

🤖 Model

rocketcoder/florence-2-lg-ocr

Extracts text from images using Microsoft's Florence-2-Large model, optimized for high-throughput document OCR processin...

ocr • 81 runs

🤖 Model 🖼️ → 📝

openai/gpt-5.4

Generate text from prompts with configurable reasoning effort and verbosity for complex professional work, coding, and m...

🖼️ → 📝 • text-generation • code-generation • image-to-text • 121.5K runs

🤖 Model 🖼️ → 📝

openai/gpt-5-structured

Generate text content with structured outputs, web search capabilities, and custom tools based on text prompts and image...

🖼️ → 📝 • text-generation • image-to-text • document-to-json • 538.1K runs

🤖 Model

turian/arxiv-llm-text

Converts arXiv papers into a single, expanded LaTeX file for processing by Large Language Models. Takes an arXiv URL as...

document-to-json • pdf-to-markdown • 21 runs

🤖 Model 🖼️ → 📝

ghostljj/deepseek-ocr

Extract text and convert documents to markdown format from images using optical character recognition. Supports multiple...

🖼️ → 📝 • ocr • pdf-to-markdown • document-to-json • 92 runs

🤖 Model

lucataco/deepseek-ocr

Converts images containing documents, PDFs, charts, and handwritten text into structured markdown while preserving forma...

ocr • pdf-to-markdown • document-to-json • 93.6K runs

🤖 Model 🖼️ → 📝

nomagick/qwen-vl-chat

Generates text responses based on text prompts and images with ChatML prompt interface and streaming support. Accepts up...

🖼️ → 📝 • text-generation • image-to-text • image-analysis • 1.1K runs

🤖 Model 🖼️ → 📝

nvidia/nemotron-nano-v2-12b-vl

Analyzes images and videos to answer questions, extract data, and provide detailed descriptions. Supports processing up...

🖼️ → 📝 • image-to-text • video-to-text • document-to-json • 988 runs