abiruyt/text-extract-ocr
Extract text from images. Accepts a single image and returns the recognized text as plain text, supporting digitization...
Found 62 models (showing 1-20)
Extract text from images. Accepts a single image and returns the recognized text as plain text, supporting digitization...
Convert PDFs or document images into Markdown or structured JSON with layout-aware OCR and element parsing. Perform page...
Generate captions, answers, and summaries for images and videos. Accepts an image or video plus a text prompt and output...
Answer questions about images, caption scenes, and localize entities with bounding boxes. Accept a ChatML-formatted prom...
Extract content and answer questions from images of documents. Takes an image plus a text prompt or question and outputs...
Answer questions about images and extract text from images. Takes an image and a text prompt and returns a text response...
Answer questions about images. Takes an image and a text prompt, and returns a text answer, enabling visual question ans...
Analyze images and answer questions from an image plus text prompt, returning text. Handle visual question answering (VQ...
Answer questions about images and videos, perform OCR, and describe scenes, returning text. Accepts an image or a video...
Answer questions about images and return text. Accepts an image and a natural-language question and outputs a textual an...
Answer questions and caption images from an input image and text prompt. Accept an image plus a natural-language query a...
Answer questions about images from an image and text prompt, returning text. Perform visual question answering, image ca...
Extract text and structure from academic PDFs using OCR. Accepts a PDF file and outputs a machine-readable transcription...
Convert scanned or digital documents to Markdown. Accepts PDF, EPUB, MOBI, XPS, and FB2 inputs and outputs Markdown plus...
Extract text from images and PDFs in 90+ languages. Accept an image or multi-page PDF, a selected language list, and a p...
Convert academic PDF documents into Markdown text. Accept a PDF and return extracted text with document structure (headi...
Extract LaTeX code from images of mathematical equations and expressions. Takes a single image as input and returns the...
Extract structured data from receipt images into JSON. Input a receipt image; output structured keyβvalue fields and lin...
Analyze documents and images from one or more image inputs plus a text prompt, returning text captions, OCR, and answers...
Answer questions about images. Takes an image and a natural-language question and returns text, enabling visual question...