
abiruyt/text-extract-ocr
Extract text from images with OCR. Takes a single image as input and outputs the detected text as a plain string for cop...
Found 49 models (showing 1-20)
Extract text from images with OCR. Takes a single image as input and outputs the detected text as a plain string for cop...
Parse PDFs and document images into structured Markdown or JSON. Perform page-level layout analysis to recover reading o...
Caption images and long videos and answer visual questions, returning text. Accepts an image or video plus an instructio...
Analyze images in conversational chat to answer questions, caption scenes, and localize objects with bounding boxes. Acc...
Extract text and answer questions about document images. Accepts an image and a text prompt and outputs text for OCR, ta...
Answer questions about images, perform OCR, and caption visual content. Takes an image and a text prompt and outputs tex...
Answer questions about images from an image input and a text prompt, returning text. Support visual question answering (...
Answer questions about images and produce text output. Handle image captioning, visual question answering (VQA), optical...
Answer questions about images and videos. Accepts an image or a video plus a question and returns text, enabling visual...
Answer questions about images and return text. Accepts an image and a natural-language question, and outputs textual res...
Caption images and answer visual questions from an image and a text prompt, returning text. Add visual understanding to...
Answer questions about an image from a text prompt and return text. Perform visual question answering, image captioning,...
Convert academic PDFs to text using OCR. Accepts a PDF document as input and returns formatted plain text suitable for s...
Convert PDFs and ebooks to Markdown. Accepts PDF, EPUB, MOBI, XPS, and FB2, including scanned documents; performs fast O...
Extract text from images and PDFs with multilingual OCR. Run line-level text detection or full OCR on selected pages and...
Extract text from academic PDF documents. Accepts a PDF input and returns a plain-text transcription using neural OCR op...
Convert images of mathematical equations into LaTeX code. Accepts an image input and returns a LaTeX string, performing...
Extract structured data from receipt images into JSON. Input a receipt image; output structured keyβvalue fields and lin...
Extract structured information and answer questions from documents, charts, tables, diagrams, and general images. Accept...
Answer questions about images. Takes an image and a natural-language question as input and returns a text answer. Suppor...