UiPath Documentation
document-understanding
latest
false
重要 :
新发布内容的本地化可能需要 1-2 周的时间才能完成。
UiPath logo, featuring letters U and I in white

Document Understanding classic user guide

上次更新日期 2026年4月23日

从收据中提取数据

重要提示:

The aim of this page is to help first time users get familiar with UiPath® Document UnderstandingTM. For scalable production deployments, we strongly recommend using the Document Understanding Process available in UiPath® Studio under the Templates section.

This quickstart shows you how to extract data from receipts using the out-of-the-box Receipts ML model with its corresponding public endpoint.

可以通过提供“验证站点”或使用 Action Center 中的“验证操作”来完成验证。以下各部分将介绍这两个选项。

将收据 ML 模型与公共端点和验证站点一起使用

在本节中,我们将使用“验证站点”来验证提取结果。

To create a basic workflow using the Receipts ML Model, use the following steps:

  1. 创建空白流程
  2. 安装所需的活动包
  3. 创建分类
  4. 将文档数字化
  5. 使用收据 ML 模型提取数据
  6. 使用验证站点验证结果
  7. 导出提取结果

1. 创建空白流程

  • 启动 UiPath Studio。
  • In the HOME backstage view, select Process to create a new project.
  • 系统将显示“新的空白流程”窗口。在此窗口中,输入新项目的名称。如果需要,您还可以添加说明,以便更轻松地对项目进行排序。
  • Select Create. The new project is opened in Studio.

2. 安装所需的活动包

From the Manage Packages button in the ribbon, besides the core activities packages (UiPath.Excel.Activities, UiPath.Mail.Activities, UiPath.System.Activities, UiPath.UIAutomation.Activities) that are added to the project by default, install the following activities packages:

3. 创建分类

Once the activities packages are installed, list out the required fields. The Receipts ML model supports data extraction for the following fields:

  • 名称 - Text
  • 供应商地址 - Address
  • 总计 - Number
  • 日期 - Date
  • 电话号码 - Text
  • 货币 - Text
  • 费用类型 - Text
  • 项目 - Table
    • 说明 - Text
    • 行数量 - Number
    • 单价 - Number
    • 数量 - Number

Open Taxonomy Manager and create a group named Semi Structured Documents, a category named Finance, and a document type named Receipts. Create the listed fields with user friendly names along with respective data types.

分类管理器的屏幕截图

4. 将文档数字化

  • In the Main.xaml file, add a Load Taxonomy activity and create a variable for the taxonomy output.
  • Add a Digitize Document activity with UiPath Document OCR. Provide the input property Document Path and create output variables for Document Text and Document Object Model.
  • 请记住在“UiPath 文档 OCR”活动中添加 Document Understanding API 密钥。

5. 使用收据 ML 模型提取数据

  • Add a Data Extraction Scope activity and fill in the properties.

  • Drag and drop a Machine Learning Extractor activity. A pop-up with three input parameters, Endpoint, ML Skill, and ApiKey, is displayed on the screen.

  • Fill in the Endpoint parameter with the Receipts Public Endpoint, namely https://du.uipath.com/ie/receipts, and provide the Document Understanding API key.

  • Select Get Capabilities.

  • The next step is to configure the extractor. Configuring the extractor means mapping the fields that you created in Taxonomy Manager to the fields available in the ML model.

    “配置提取程序”对话框的屏幕截图。

  • 要将机器学习提取程序与 ML 技能一起使用,请从下拉列表中选择 ML 技能并配置提取程序。

  • 您必须将机器人 Assistant 连接到与 ML 技能所在的同一租户。

    “机器学习提取程序”对话框中“ML 技能”下拉列表的屏幕截图。

6. 使用验证站点验证结果

To check the results through Validation Station, drag and drop the Present Validation Station activity and provide the input details.

“显示验证站点”活动的屏幕截图。

7. 导出提取结果

To export the extraction results, drag and drop an Export Extraction Results activity to the end of your workflow. This outputs the results into a DataSet that contains multiple tables, which could then be written to an Excel file or be used directly in a downstream process.

描述“导出提取结果”活动的屏幕截图。

下载示例

Download this sample project using this link.

该示例包含两个工作流:

  • Main.xaml - in this workflow, the extraction results are validated using Validation Station
  • Main - Unattended.xaml - 在此工作流中,使用“验证操作”验证提取结果;下一节将对此进行说明

将收据 ML 模型与公共端点和验证操作一起使用

请参阅此页面,详细了解如何使用 Action Center 验证操作,而非显示验证站点。

Action Center 中的任务如何工作?

当自动化包括应人工做出的决策(例如批准、升级和异常处理)时,UiPath Action Center 可以轻松高效地将流程从机器人移交给人工。然后再次返回。

Document Understanding Action Center 活动随附 UiPath.IntelligentOCR.Activities 包和 UiPath.Persistance.Activities 包。请勿忘记从 UiPath Studio 的“常规设置”中启用“暂留”活动:

描述“项目设置”界面的屏幕截图,突出显示“支持暂留”切换按钮。

验证操作如何工作?

通过在内部部署 Orchestrator 和 Automation Cloud 中创建可在 Action Center 中添加文档验证操作的编排流程来提高生产力。此操作可减少在本地存储文档、在每个人类操作的计算机上安装一个机器人或让机器人等待人类用户完成验证的需求。

More details here.

如何使用验证操作?

重复上一部分中所述的第 1 步至第 5 步。

Then, instead of using the Present Validation Station activity, use the Create Document Validation Action and Wait for Document Validation Action and Resume activities.

“创建文档验证操作”活动的屏幕截图。

This creates a document validation action in Action Center. The output of the Create Document Validation Action activity can then be used with the Wait for Document Validation Action and Resume activity to suspend and resume orchestration workflows upon human action completion in Action Center.

此页面有帮助吗?

连接

需要帮助? 支持

想要了解详细内容? UiPath Academy

有问题? UiPath 论坛

保持更新