
bytedance/dolphin
Parse PDFs and document images into structured Markdown or JSON. Perform page-level layout analysis to recover reading o...
Found 11 models (showing 1-11)
Parse PDFs and document images into structured Markdown or JSON. Perform page-level layout analysis to recover reading o...
Convert PDFs and ebooks to Markdown. Accepts PDF, EPUB, MOBI, XPS, and FB2, including scanned documents; performs fast O...
Extract structured data from receipt images into JSON. Input a receipt image; output structured keyβvalue fields and lin...
Extract personally identifiable information (PII) from text and return structured JSON. Detects Indian names, father nam...
Extract document layout and text from an image into structured JSON. Accepts a scanned page or document image and return...
Extract structured data from web pages into JSON. Accept a URL or raw HTML, plus element_prompts (comma-separated target...
Extract entities and relationships from Spanish legal text and return structured JSON for knowledge graph construction....
Extract text and layout metadata from PDF pages. Takes a PDF and page number as input and returns a structured JSON stri...
Extract text and structured data from images and multi-page PDFs. Provide an image or PDF plus a prompt string for scene...
Link entities in text to Wikipedia pages. Accepts one or more text strings and returns machine-readable annotations: det...
Extract structured receipt data from an image and return JSON. Takes a receipt photo or scan as input and outputs a JSON...