abiruyt/text-extract-ocr
Extract text from images. Accepts a single image and returns the recognized text as plain text, supporting digitization...
Found 65 models (showing 1-20)
Extract text from images. Accepts a single image and returns the recognized text as plain text, supporting digitization...
Convert PDFs or document images into Markdown or structured JSON with layout-aware OCR and element parsing. Perform page...
Generate captions, answers, and summaries for images and videos. Accepts an image or video plus a text prompt and output...
Answer questions about images, caption scenes, and localize entities with bounding boxes. Accept a ChatML-formatted prom...
Extract content and answer questions from images of documents. Takes an image plus a text prompt or question and outputs...
Answer questions about images and extract text from images. Takes an image and a text prompt and returns a text response...
Answer questions about images. Takes an image and a text prompt, and returns a text answer, enabling visual question ans...
Analyzes images and answers questions about visual content using a Mixture-of-Experts architecture. Takes an image and t...
Answer questions about images and videos, perform OCR, and describe scenes, returning text. Accepts an image or a video...
Answers questions about images through multimodal understanding. Takes an image and a text question as input and generat...
Analyzes images and generates text descriptions or answers questions about visual content. Uses a projection module trai...
Answer questions about images from an image and text prompt, returning text. Perform visual question answering, image ca...
Extract text and structure from academic PDFs using OCR. Accepts a PDF file and outputs a machine-readable transcription...
Convert scanned or digital documents to Markdown. Accepts PDF, EPUB, MOBI, XPS, and FB2 inputs and outputs Markdown plus...
Extract text from images and PDFs in 90+ languages. Accept an image or multi-page PDF, a selected language list, and a p...
Convert academic PDF documents into Markdown text. Accept a PDF and return extracted text with document structure (headi...
Extract LaTeX code from images of mathematical equations and expressions. Takes a single image as input and returns the...
Extract structured data from receipt images into JSON. Input a receipt image; output structured keyβvalue fields and lin...
Analyze documents and images from one or more image inputs plus a text prompt, returning text captions, OCR, and answers...
Analyzes images and answers questions about them using a unified autoregressive framework for multimodal understanding....