Akabot Vision
Akabot Vision is the intelligent document processing (IDP) module that reads, classifies, and extracts structured data from unstructured sources — scanned invoices, contracts, forms, emails, and images. It combines OCR, machine learning, and configurable extraction templates to turn paper-based and digital documents into actionable data for your workflows.

Key features
- OCR engine — High-accuracy text extraction from scanned PDFs, images (PNG, JPEG, TIFF), and digital documents.
- Document classification — Automatically route incoming documents to the correct extraction template based on content.
- Field extraction — Define extraction rules with a visual template editor; no ML expertise required.
- Table detection — Identify and extract multi-row tables from invoices, purchase orders, and reports.
- Confidence scores — Every extracted field carries a confidence score so low-confidence items can be routed for human review.
- Human-in-the-loop — Built-in validation station for operators to review and correct uncertain extractions before data enters downstream systems.
Supported document types
- Invoices and purchase orders
- Identity documents (passports, national IDs)
- Contracts and legal agreements
- Bank statements and financial reports
- Medical records and forms
- Custom document types via template training
Getting started
- Upload sample documents to the Vision portal.
- Create a document type and label fields using the template editor.
- Train the model and validate extraction accuracy.
- Use the Vision activities in Studio to send documents and retrieve extracted data.
Next steps
- Activities reference — Vision activity catalog.
- Data Service — Persist extracted data for downstream processes.