bytedance/dolphin
Convert PDFs or document images into Markdown or structured JSON with layout-aware OCR and element parsing. Perform page...
Found 13 models (showing 1-13)
Convert PDFs or document images into Markdown or structured JSON with layout-aware OCR and element parsing. Perform page...
Convert scanned or digital documents to Markdown. Accepts PDF, EPUB, MOBI, XPS, and FB2 inputs and outputs Markdown plus...
Extract structured data from receipt images into JSON. Input a receipt image; output structured keyβvalue fields and lin...
Extract PII from text into structured JSON. Accepts a text string and returns fields such as name, fatherName, dob, and...
Extract structured document layout and text from an image input and return a single JSON output. Parse page elements wit...
Extract structured data from web pages into JSON. Accept a URL or raw HTML plus natural-language element prompts, scope...
Extract entities and relationships from Spanish legal text into structured JSON for knowledge graph construction and leg...
Extract text and document metadata from PDF pages, returning structured JSON. Accepts a PDF and page number, then transc...
Extract text and structured data from images and multi-page PDFs using visual OCR and layout analysis. Accept an image o...
Link entities in text to Wikipedia pages. Accepts text and returns disambiguated entity mentions with start/end characte...
Extract structured purchase data from receipt images as JSON. Input a receipt image and output JSON with line items, qua...
Convert documents to Markdown and structured JSON. Accept PDF, DOC/DOCX, PPT/PPTX, and image files (PNG/JPG/WEBP) as inp...
Perform batch OCR on document images and return structured JSON. Input a ZIP of images (JPG, PNG, WEBP) and choose betwe...