bytedance/dolphin
Convert PDFs or document images into Markdown or structured JSON with layout-aware OCR and element parsing. Perform page...
Found 8 models (showing 1-8)
Convert PDFs or document images into Markdown or structured JSON with layout-aware OCR and element parsing. Perform page...
Convert scanned or digital documents to Markdown. Accepts PDF, EPUB, MOBI, XPS, and FB2 inputs and outputs Markdown plus...
Convert academic PDF documents into Markdown text. Accept a PDF and return extracted text with document structure (headi...
Convert PDFs, Office files, images, audio, HTML, and structured data to Markdown for LLM ingestion, indexing, and analys...
Convert PDFs to Markdown with optional OCR for scanned documents. Accepts a PDF and a method setting (auto, txt, or ocr)...
Convert images or PDFs containing mathematical notation into Markdown/LaTeX text. Accept an image input and return a tex...
Convert documents to Markdown and structured JSON. Accept PDF, DOC/DOCX, PPT/PPTX, and image files (PNG/JPG/WEBP) as inp...
Extract text and structure from academic PDFs using OCR. Accepts a PDF file and outputs a machine-readable transcription...