Tag: ocr

Filtered selection of tools tagged ocr.

ABBYY Vantage

ABBYY Vantage is an enterprise platform for intelligent document processing where OCR, classification, extraction, and human review work together.

Automation Custom quote

AWS Textract

AWS Textract is a cloud service for extracting text, tables, form fields, and structured document data inside AWS architectures.

Developer Usage-based

Klippa

Klippa provides OCR and document processing for invoices, receipts, and other business documents, often used in API-driven finance workflows.

Automation Plan-based

Mistral OCR

Mistral OCR is a document AI capability for developers who want to feed OCR results into LLM and agent workflows.

Developer Usage-based

Rossum

Rossum is a document AI platform for teams that need to extract and validate structured data from recurring business documents such as invoices, purchase orders, and delivery notes.

Automation Custom quote

Veryfi

Veryfi is an API-first service for receipt, invoice, and accounting data where structured output matters more than plain text OCR.

Developer Usage-based

Azure AI Document Intelligence

Azure AI Document Intelligence is Microsoft's service for OCR, form analysis, and structured document extraction in Azure and Microsoft 365-adjacent architectures.

Developer Usage-based

Google Document AI

Google Document AI combines OCR, specialized document processors, and structured extraction for teams processing document data in Google Cloud workflows.

Developer Usage-based

Mindee

Mindee is an API-oriented OCR and document AI service that helps developers extract structured fields from invoices, receipts, and other document types.

Developer Usage-based

Nanonets

Nanonets combines OCR, field extraction, and workflow automation so documents can be recognized, reviewed, and routed onward.

Automation Plan-based

OCRmyPDF

OCRmyPDF adds a searchable text layer to scanned PDFs and is especially useful as a clean preprocessing step in local document pipelines.

Developer Open Source

PaddleOCR

PaddleOCR is an open-source OCR toolkit for developers who want more control over recognition, layout analysis, and custom document pipelines.

Developer Open Source

Tesseract OCR

Tesseract OCR is an open-source OCR engine for local text recognition and remains an important building block when privacy, control, or cost argue against cloud OCR.

Developer Open Source