
bytedance/dolphin
Parse PDFs and document images into structured Markdown or JSON. Perform page-level layout analysis to recover reading o...
Found 6 models (showing 1-6)
Parse PDFs and document images into structured Markdown or JSON. Perform page-level layout analysis to recover reading o...
Convert PDFs and ebooks to Markdown. Accepts PDF, EPUB, MOBI, XPS, and FB2, including scanned documents; performs fast O...
Extract text from academic PDF documents. Accepts a PDF input and returns a plain-text transcription using neural OCR op...
Convert PDFs, Office documents, images, audio, HTML, and other files to clean Markdown text. Accepts PDF, DOCX, PPTX, XL...
Convert PDFs to Markdown text. Accepts a PDF input and extracts content using the embedded text layer (txt) or optical c...
Extract LaTeX-formatted math from images or PDFs and return Markdown text. Takes an image (or PDF) containing equations...