bytedance/dolphin
Convert PDFs or document images into Markdown or structured JSON with layout-aware OCR and element parsing. Perform page...
Found 11 models (showing 1-11)
Convert PDFs or document images into Markdown or structured JSON with layout-aware OCR and element parsing. Perform page...
Convert scanned or digital documents to Markdown. Accepts PDF, EPUB, MOBI, XPS, and FB2 inputs and outputs Markdown plus...
Convert academic PDF documents into Markdown text. Accept a PDF and return extracted text with document structure (headi...
Convert PDFs, Office files, images, audio, HTML, and structured data to Markdown for LLM ingestion, indexing, and analys...
Convert PDFs to Markdown with optional OCR for scanned documents. Accepts a PDF and a method setting (auto, txt, or ocr)...
Convert images or PDFs containing mathematical notation into Markdown/LaTeX text. Accept an image input and return a tex...
Convert documents to Markdown and structured JSON. Accept PDF, DOC/DOCX, PPT/PPTX, and image files (PNG/JPG/WEBP) as inp...
Extract text and structure from academic PDFs using OCR. Accepts a PDF file and outputs a machine-readable transcription...
Converts arXiv papers into a single, expanded LaTeX file for processing by Large Language Models. Takes an arXiv URL as...
Extract text and convert documents to markdown format from images using optical character recognition. Supports multiple...
Converts images containing documents, PDFs, charts, and handwritten text into structured markdown while preserving forma...