Tag: data-extraction
Filtered selection of tools tagged data-extraction.
AWS Textract
AWS Textract is a cloud service for extracting text, tables, form fields, and structured document data inside AWS architectures.
Klippa
Klippa provides OCR and document processing for invoices, receipts, and other business documents, often used in API-driven finance workflows.
Mistral OCR
Mistral OCR is a document AI capability for developers who want to feed OCR results into LLM and agent workflows.
Docparser
Docparser extracts structured data from recurring PDFs and documents when layouts are stable enough for rules, zones, or parser logic to work reliably.
Google Document AI
Google Document AI combines OCR, specialized document processors, and structured extraction for teams processing document data in Google Cloud workflows.
Mindee
Mindee is an API-oriented OCR and document AI service that helps developers extract structured fields from invoices, receipts, and other document types.
Parseur
Parseur is a parser for emails, PDFs, and attachments that can send document data into spreadsheets, webhooks, or automation tools.