UiPath Documentation
document-understanding
latest
false
  • Overview
    • Introduction
    • Capabilities overview
    • AI Center relation to Document Understanding
  • Getting started
    • Introduction
    • Enabling Document Understanding™ and creating a project
    • Customer-Managed Keys
    • Role based access control in Document Understanding™
      • Access the role based access control
      • Manage access
        • Creating a custom role
    • Document types (Document Manager)
      • Access Document types
      • Use a predefined schema
      • Create and configure fields
      • Import documents
      • Label documents
      • Search documents
      • Export documents
      • Checkboxes and signatures
      • Dataset diagnostics
    • Forms AI
    • One Click Classification
    • One Click Extraction
  • Activities
    • Activities packages
      • UiPath.DocumentProcessing.Contracts
      • UiPath.DocumentUnderstanding.Activities
      • UiPath.DocumentUnderstanding.ML.Activities
      • UiPath.DocumentUnderstanding.OCR.LocalServer.Activities
      • UiPath.IntelligentOCR.Activities
      • UiPath.OCR.Activities
      • UiPath.OCR.Contracts
      • UiPath.OmniPage.Activities
      • UiPath.PDF.Activities
  • Insights dashboards
    • AI units consumption dashboards
      • [Preview] AI units consumption overview
      • IxP AI Unit consumption dashboard
      • FAQ and troubleshooting
  • Document Understanding Process
    • Document Understanding Process template
  • Quickstart tutorials
    • Extracting data from receipts
    • Invoices retrained with one additional field
    • Extracting data from Forms
    • Create a new automation starting from a file
  • Framework components
    • Overview
    • Document Understanding activities
    • Intelligent OCR activities
      • Taxonomy
        • Taxonomy overview
        • Taxonomy Manager
        • Taxonomy related activities
      • Digitization
        • Digitization overview
        • Digitization related activities
        • OCR engines
      • Document classification
        • Document classification overview
        • Configure Classifiers Wizard of Classify Document Scope
        • Intelligent Keyword Classifier
        • Keyword Based Classifier
        • Machine Learning Classifier
        • Generative Classifier
        • Document classification related activities
      • Document Classification Validation
        • Document classification validation overview
        • Classification Station
        • Document classification validation related activities
      • Document Classification Training
        • Configure Classifiers Wizard of Train Classifiers Scope
        • Document classification training overview
        • Document classification training related activities
        • Machine Learning Classifier Trainer
      • Data extraction
        • Configure Extractors Wizard of Data Extraction Scope
        • Data extraction overview
        • Data extraction related activities
        • Form Extractor
        • Intelligent Form Extractor
        • Machine Learning Extractor
        • RegEx Based Extractor
      • Data Extraction Validation
        • Data Extraction Validation overview
        • Data extraction validation related activities
        • Validation Station
      • Data Extraction Training
        • Configure Extractors Wizard of Train Extractors scope
        • Data Extraction Training overview
        • Data extraction training related activities
        • Machine Learning Extractor Trainer
      • Data consumption
    • API calls
  • ML packages
    • Overview
    • Document Understanding - ML package
    • DocumentClassifier - ML package
    • ML packages with OCR capabilities
    • Out-of-the-box pre-trained ML packages
      • 1040 - ML package
      • 1040 Schedule C - ML package
      • 1040 Schedule D - ML package
      • 1040 Schedule E - ML package
      • 1040x - ML package
      • 3949a - ML package
      • 4506T - ML package
      • 709 - ML package
      • 941x - ML package
      • 9465 - ML package
      • ACORD125 - ML package
      • ACORD126 - ML package
      • ACORD131 - ML package
      • ACORD140 - ML package
      • ACORD25 - ML package
      • Bank Statements - ML package
      • Bills Of Lading - ML package
      • Certificate of Incorporation - ML package
      • Certificate of Origin - ML package
      • Checks - ML package
      • Children Product Certificate - ML package
      • CMS 1500 - ML package
      • EU Declaration of Conformity - ML package
      • Financial Statements - ML package
      • FM1003 - ML package
      • I9 - ML package
      • ID Cards - ML package
      • Invoices - ML package
      • Invoices Australia - ML package
      • Invoices China - ML package
      • Invoices Hebrew - ML package
      • Invoices India - ML package
      • Invoices Japan - ML package
      • Invoices Shipping - ML package
      • Packing Lists - ML package
      • Payslips - ML package
      • Passports - ML package
      • Purchase Orders - ML package
      • Receipts - ML package
      • Remittance Advices - ML package
      • UB04 - ML package
      • Utility Bills - ML package
      • Vehicle Titles - ML package
      • W2 - ML package
      • W9 - ML package
    • Other Out-of-the-box ML Packages
    • Public endpoints
    • Traffic limitations
    • OCR Configuration
  • Pipelines
    • About pipelines
      • Terms and definitions
    • Training Pipelines
    • Evaluation Pipelines
    • Full Pipelines
    • Fine-tuning
    • The Auto-Fine-tuning Loop (Public Preview)
  • OCR services
    • OCR services
  • Supported languages
    • Overview
    • OCR
    • ML models and capabilities
  • Deep Learning
    • Training high performing models
    • Deploying high performing models
  • Data and security
    • Data residency
    • Data storage
    • Legal information
  • Licensing
    • API Key
    • Cloud and on-premises usage
    • Metering and charging logic (Unified Pricing)
    • Metering and charging logic (Flex Plan)
UiPath logo, featuring letters U and I in white

Document Understanding User Guide

Last updated Feb 9, 2026

OCR engines

An OCR Engine is used in the Digitization component, to identify text in a file, when native content is not available.

Note:

The images that need to be processed should have a resolution range of:

  • min: 50 x 50 pixels
  • max: 9000 x 9000 pixels

Here is a selection of OCR Engines that you can choose from, according to your needs, throughout the Document UnderstandingTM Framework.

OCR Engine

Activity Pack

Debug Logs Format in Logs Folder

Reports Confidence

UiPath Extended Languages OCRUiPath.OCR.Activities${date:format=yyyy-MM-dd}

UiPath.OCR.Activities

${date:format=yyyy-MM-dd}

OCR for Chinese, Japanese and KoreanUiPath.Core.Activities.CjkOCR${date:format=yyyy-MM-dd}

UiPath.OmniPage.Activities

${date:format=yyyy-MM-dd}

UiPath.UIAutomation.Activities

${date:format=yyyy-MM-dd}

No if DetectionMode is set to TextDetection (default)

Yes if DetectionMode is set to DocumentTextDetection

UiPath.UIAutomation.Activities

${date:format=yyyy-MM-dd}

No if UseReadAPI is not selected (default)

Yes if UseReadAPI is selected

UiPath.UIAutomation.Activities

${date:format=yyyy-MM-dd}

UiPath.UIAutomation.Activities

${date:format=yyyy-MM-dd}

Note: When debugging errors, you can always visit the logs folder and check the relevant OCR log files. Read more about logging here.

Was this page helpful?

Connect

Need help? Support

Want to learn? UiPath Academy

Have questions? UiPath Forum

Stay updated