UiPath Documentation
document-understanding
2022.4
true
  • Overview
    • Introduction
    • Language Support
    • AI Center Relation to Document Understanding
  • Document Understanding Process
    • Document Understanding Process: Studio Template
  • Quickstart Tutorials
    • Extracting Data From Receipts
    • Invoices Retrained With One Additional Field
    • Extracting Data From Forms
  • Framework Components
    • Taxonomy
      • Taxonomy Overview
      • Taxonomy Manager
      • Taxonomy Related Activities
    • Digitization
      • Digitization Overview
      • OCR Engines
      • Digitization Related Activities
    • Document Classification
      • Document Classification Overview
      • Configure Classifiers Wizard of Classify Document Scope
      • Keyword Based Classifier
        • Special Requirements
      • Intelligent Keyword Classifier
        • Special Requirements
      • FlexiCapture Classifier
      • Machine Learning Classifier
      • Document Classification Related Activities
    • Document Classification Validation
      • Document Classification Validation Overview
      • Classification Station
      • Document Classification Validation Related Activities
    • Document Classification Training
      • Document Classification Training Overview
      • Configure Classifiers Wizard of Train Classifiers Scope
      • Machine Learning Classifier Trainer
      • Document Classification Training Related Activities
    • Data Extraction
      • Data Extraction Overview
      • Configure Extractors Wizard of Data Extraction Scope
      • RegEx Based Extractor
        • Special Requirements
      • Form Extractor
        • Special Requirements
      • Intelligent Form Extractor
        • Special Requirements
      • Machine Learning Extractor
      • FlexiCapture Extractor
      • Data Extraction Related Activities
    • Data Extraction Validation
      • Data Extraction Validation Overview
      • Validation Station
      • Data Extraction Validation Related Activities
    • Data Extraction Training
      • Data Extraction Training Overview
      • Configure Extractors Wizard of Train Extractors Scope
      • Machine Learning Extractor Trainer
      • Data Extraction Training Related Activities
    • Data Consumption
      • Data Consumption Overview
      • Data Consumption Related Activities
  • ML Packages
    • About ML Packages
    • Hardware Requirements
    • Supported languages
      • OCR
      • ML Packages
      • Other services
    • OCR Configuration
  • Pipelines
    • About Pipelines
    • Training Pipelines
    • Evaluation Pipelines
    • Full Pipelines
    • Fine-tuning
    • The Auto-Fine-tuning Loop (Public Preview)
  • Document Manager
    • About Document Manager
    • Access Document Manager
    • Use a Predefined Schema
    • Create and Configure Fields
    • Import Documents
    • Label Documents
    • Search Documents
    • Export Documents
    • Checkboxes and Signatures
  • OCR Services
    • OCR Services
  • Document Understanding deployed in Automation Suite
    • Install and Use
    • First Run Experience
    • Deploy UiPathDocumentOCR
    • Deploy an Out-of-the-box ML Package
    • ML Packages Offline Installation
      • Offline bundles 2022.4.15
      • Offline Bundles 2022.4.14
      • Offline Bundles 2022.4.13
      • Offline Bundles 2022.4.12
      • Offline Bundles 2022.4.11
      • Offline Bundles 2022.4.9
      • Offline Bundles 2022.4.7
      • Offline Bundles 2022.4.5
      • Offline Bundles 2022.4.0
    • Use Document Manager
    • Use the Framework
  • Document Understanding deployed in AI Center standalone
    • Install and Use
      • Hardware Requirements
      • SQL Server Requirements
    • First Run Experience
      • Activate the License
      • Create a Project on AI Center
      • Upload the Document Understanding ML Packages
      • Create a Data Labeling Session
      • Launch the Data Labeling Session
    • Deploy UiPathDocumentOCR
    • Deploy an Out-of-the-box ML Package
      • Create an Invoices ML Package
      • Deploy the Invoices ML Package as an ML Skill
    • ML Packages Offline Installation
    • Use Document Manager
    • Use the Framework
      • Use Document Understanding Models (including UiPathDocumentOCR)
  • Deep Learning
    • Training High Performing Models
      • Data Extraction Components
  • Licensing
    • Public Endpoints
    • API Key
    • Cloud and On-Prem Usage
      • Machine Learning Extractor
    • Metering & Charging Logic
    • Legal Information
  • References
    • Activities Packages
      • UiPath.Abbyy.Activities
      • UiPath.AbbyyEmbedded.Activities
      • UiPath.DocumentUnderstanding.ML.Activities
      • UiPath.DocumentUnderstanding.OCR.LocalServer.Activities
      • UiPath.IntelligentOCR.Activities
      • UiPath.OCR.Activities
      • UiPath.OCR.Contracts
      • UiPath.DocumentProcessing.Contracts
      • UiPath.OmniPage.Activities
      • UiPath.PDF.Activities
UiPath logo, featuring letters U and I in white

Document Understanding User Guide

Last updated Apr 4, 2025

Extracting Data From Receipts

Important:

The aim of this page is to help first time users get familiar with Document Understanding.

For scalable production deployments, we strongly recommend using the Document Understanding Process available in UiPath Studio under the Templates section.

This quickstart shows you how to extract data from receipts using the out-of-the-box Receipts ML model with its corresponding public endpoint.

Validation can be done either by presenting the Validation Station or by using the Validation Action in Action Center. Both options are described in the following sections.

Using Receipts ML Model With Public Endpoint and Validation Station

In this section, we are going to validate the extraction results using Validation Station.

To create a basic workflow using the Receipts ML Model follow the steps below.

  1. Create a blank process
  2. Install the required activities packages
  3. Create a taxonomy
  4. Digitize the document
  5. Extract the data using the Receipts ML model
  6. Validate the results using Validation Station
  7. Export the extraction results

Now, let us see every step in detail.

1. Create a Blank Process

Launch UiPath Studio.

In the HOME backstage view, click Process to create a new project.

The New Blank Process window is displayed. In this window, enter a name for the new project. If you want, you can also add a description to sort through your projects more easily.

Click Create. The new project is opened in Studio.

2. Install the Required Activities Packages

From the Manage Packages button in the ribbon, besides the core activities packages (UiPath.Excel.Activities,UiPath.Mail.Activities,UiPath.System.Activities,UiPath.UIAutomation.Activities) that are added to the project by default, install the following activities packages:

3. Create a Taxonomy

Once the activity packages are installed, list out the required fields. The Receipts ML model supports data extraction for the fields below:

  • name - Text
  • vendor-addr - Address
  • total - Number
  • date - Date
  • phone - Text
  • currency - Text
  • expense-type - Text
  • items - Table
    • description - Text
    • line-amount - Number
    • unit-price - Number
    • quantity - Number

Open Taxonomy Manager and create a group named "Semi Structured Documents", a category named "Finance", and a document type named "Receipts". Create the above listed fields with user-friendly names along with respective data types.



4. Digitize the Document

In the Main.xaml file, add a Load Taxonomy activity and create a variable for the taxonomy output.

Add a Digitize Document activity with UiPath Document OCR. Provide the input property Document Path and create output variables for Document Text and Document Object Model.

Remember to add the Document Understanding API Key in the UiPath Document OCR activity.

5. Extract the Data Using the Receipts ML Model

Add a Data Extraction Scope activity and fill in the properties.

Drag and drop a Machine Learning Extractor activity. A pop-up with three input parameters, Endpoint, ML Skill, and ApiKey, is displayed on the screen.

Fill in the Endpoint parameter with the Receipts Public Endpoint, namely https://du.uipath.com/ie/receipts, and provide the Document Understanding API key.

Click on Get Capabilities.

The next step is to configure the extractor. Configuring the extractor means mapping the fields that you created in Taxonomy Manager to the fields available in the ML model like shown in the below image:



To use the Machine Learning Extractor with an ML Skill, choose the ML Skill from the dropdown and configure the extractor.

You must have your robot assistant connected to the same tenant as your ML Skill.



6. Validate the Results Using Validation Station

To check the results through Validation Station, drag and drop the Present Validation Station activity and provide the input details.



7. Export the Extraction Results

To export the extraction results, drag and drop an Export Extraction Results activity to the end of your workflow. This outputs the results into a DataSet that contains multiple tables, which could then be written to an Excel file or be used directly in a downstream process.


Download Example

Download this sample project using this link.

The example contains two workflows:

  • Main.xaml - in this workflow, the extraction results are validated using Validation Station; this is described in the above section
  • Main - Unattended.xaml - in this workflow, the extraction results are validated using Validation Action; this is described in the following section

Using Receipts ML Model With Public Endpoint and Validation Action

Now, let’s see how to use an Action Center Validation Action instead of presenting the Validation Station.

How Do Tasks in Action Center Work?

When an automation includes decisions that a human should make, such as approvals, escalations, and exceptions, UiPath Action Center makes it easy and efficient to hand off the process from robot to human. And back again.

Document Understanding Action Center activities come with the UiPath.IntelligentOCR.Activities package and the UiPath.Persistance.Activities package. Don’t forget to enable Persistence activities from the General Settings in UiPath Studio:



How Does the Validation Action Work?

Productivity can be increased by adding an orchestration process that adds document validation actions in Action Center, in both on-premises Orchestrator and Automation Cloud. This action reduces the need for storing the documents locally, having a robot installed on each human's operated machine, or having the robot wait for human users to finish validation.

More details here.

How to Use the Validation Action?

Repeat steps 1 to 5 described in the above section.

Then, instead of using the Present Validation Station activity, use the Create Document Validation Action and Wait for Document Validation Action and Resume activities.

The below image shows the Create Document Validation Action activity and its properties.



This creates a document validation action in Action Center. The output of the Create Document Validation Action activity can then be used with the Wait for Document Validation Action and Resume activity to suspend and resume orchestration workflows upon human action completion in Action Center.

Was this page helpful?

Connect

Need help? Support

Want to learn? UiPath Academy

Have questions? UiPath Forum

Stay updated