- 概述
- 文档处理合同
- 发行说明
- 关于文档处理合同
- Box 类
- IPersistedActivity 接口
- PrettyBoxConverter 类
- IClassifierActivity 接口
- IClassifierCapabilitiesProvider 接口
- 分类器文档类型类
- 分类器结果类
- 分类器代码活动类
- 分类器原生活动类
- 分类器异步代码活动类
- 分类器文档类型功能类
- ContentValidationData Class
- EvaluatedBusinessRulesForFieldValue Class
- EvaluatedBusinessRuleDetails Class
- 提取程序异步代码活动类
- 提取程序代码活动类
- 提取程序文档类型类
- 提取程序文档类型功能类
- 提取程序字段功能类
- 提取程序原生活动类
- 提取程序结果类
- FieldValue Class
- FieldValueResult Class
- ICapabilitiesProvider 接口
- IExtractorActivity 接口
- 提取程序有效负载类
- 文档操作优先级枚举
- 文档操作数据类
- 文档操作状态枚举
- 文档操作类型枚举
- 文档分类操作数据类
- 文档验证操作数据类
- 用户数据类
- 文档类
- 文档拆分结果类
- DomExtensions 类
- 页类
- 页面分区类
- 多边形类
- 多边形转换器类
- 元数据类
- 词组类
- 词类
- 处理源枚举
- 结果表格单元类
- 结果表值类
- 结果表列信息类
- 结果表类
- 旋转枚举
- Rule Class
- RuleResult Class
- RuleSet Class
- RuleSetResult Class
- 分区类型枚举
- 词组类型枚举
- IDocumentTextProjection 接口
- 分类结果类
- 提取结果类
- 结果文档类
- 结果文档范围类
- 结果数据点类
- 结果值类
- 结果内容引用类
- 结果值令牌类
- 结果派生字段类
- 结果数据源枚举
- 结果常量类
- 简单字段值类
- 表字段值类
- 文档组类
- 文档分类类
- 文档类型类
- 字段类
- 字段类型枚举
- FieldValueDetails Class
- 语言信息类
- 元数据输入类
- 文本类型枚举
- 类型字段类
- ITrackingActivity 接口
- ITrainableActivity 接口
- ITrainableClassifierActivity 接口
- ITrainableExtractorActivity 接口
- 可训练的分类器异步代码活动类
- 可训练的分类器代码活动类
- 可训练的分类器原生活动类
- 可训练的提取程序异步代码活动类
- 可训练的提取程序代码活动类
- 可训练的提取程序原生活动类
- 基本数据点类 - 预览
- 提取结果处理程序类 - 预览
- Document Understanding ML
- Document Understanding OCR 本地服务器
- Document Understanding
- 智能 OCR
- 发行说明
- 关于“智能 OCR”活动包
- 项目兼容性
- 加载分类
- 将文档数字化
- 分类文档作用域
- 基于关键词的分类器
- Document Understanding 项目分类器
- 智能关键词分类器
- 创建文档分类操作
- 创建文档验证工件
- 检索文档验证工件
- 等待文档分类操作然后继续
- 训练分类器范围
- 基于关键词的分类训练器
- 智能关键词分类训练器
- 数据提取作用域
- Document Understanding 项目提取程序
- Document Understanding 项目提取程序训练器
- 基于正则表达式的提取程序
- 表单提取程序
- 智能表单提取程序
- 文档脱敏
- 创建文档验证操作
- 等待文档验证操作然后继续
- 训练提取程序范围
- 导出提取结果
- 机器学习提取程序
- 机器学习提取程序训练器
- 机器学习分类器
- 机器学习分类训练器
- 生成分类器
- 生成式提取程序
- 配置身份验证
- ML 服务
- OCR
- OCR 合同
- OmniPage
- PDF
- [未公开] Abbyy
- [未列出] Abbyy 嵌入式

Document Understanding 活动
关于“智能 OCR”活动包
UiPath™.IntelligentOCR.Activities 包含使用完整、开放、可扩展的方法启用文档处理流程的基础架构。
When installing the UiPath.IntelligentOCR.Activities package, the UiPath.DocumentUnderstanding.ML.Activities package is automatically installed to perform ML-based classification and extraction.
已知限制
如果您使用的是暂停工作流的“等待”类型活动,并且您正在使用DataTable变量,则DataTable变量必须可序列化。例如,如果使用new System.Data.DataTable初始化DataTable变量,则它将变为不可序列化,这可能导致项目执行失败。为防止出现这种情况,您可以将DataTable变量的默认值留空,或为DataTable变量分配一个名称,例如: new System.Data.DataTable("MyTable") 。
替换已移除的版本
下表显示了已删除的包版本,以及建议使用的版本。
Table 1. The removed versions and their recommended replacements
| 推荐版本 | |
|---|---|
| 4.3.0-preview | 4.4.0-preview |
| 2.1.0 | 2.2.0 |
| 1.4.0 | 1.5.0 |
| 1.2.0 | 1.2.1 |
重要
- Starting with the v6.19.0 release, when installing the UiPath.IntelligentOCR.Activities package in a project, the UiPath.DocumentUnderstanding.ML.Activities package is automatically installed as well and you do not need to install it separately.
- If you are using UiPath® Studio 2023.4.4 or earlier, make sure to install the latest version of Windows .NET 6.0 Desktop Runtime.
版本兼容性
Updating the UiPath.IntelligentOCR.Activities also requires an update for the UiPath.UIAutomation.Activities package and for UiPath.OCR.Activities package if included in the project.
UiPath.IntelligentOCR.Activities and UiPath.DocumentUnderstanding.Activities should not be used together in the same project. The UiPath.IntelligentOCR.Activities package should be used for Windows (or Legacy) workflows, while the UiPath.DocumentUnderstanding.Activities package should be used for Cross-platform workflows.
Before using the Monitor feature, take into consideration the following:
- Only UiPath.DocumentUnderstanding.Activities v 2.7.0 or newer versions support Monitor;
- Only IntelligentOCR.Activities v6.20 or newer versions include the Document Understanding Project Extractor activity which supports Monitor
支持的格式
“智能 OCR”活动包可以支持以下任何文件类型:.png, .gif, .jpe, .jpg, .jpeg, .tiff, .tif, .bmp 和 .pdf。
支持 C# 项目语言
从版本 4.10.0 开始, 此活动包已经过验证,可在 C# 项目中使用。
功能
本节显示 Intelligent.OCR 包的多种功能。
将文档数字化
You can achieve this using the Digitize Document activity. This retrieves the text from any PDF or image, using, only if necessary, the OCR engine of your choice.
在逐个处理文档时,它们会经历数字化流程。 非数字(扫描)文档的区别在于,您需要应用所选的 OCR 引擎。 此步骤的输出是文档对象模型和包含所有文档文本的字符串变量,并将传递到后续步骤。
分类文档
You can achieve this using the Classify Document activity. This allows identifying what type of document a file is by using any classification algorithm.
数字化后,文档将进行分类。 如果您在同一个项目中使用多种文档类型,则要正确提取数据,您需要知道正在使用的文档类型。 重要的是,您可以在同一作用域中使用多个分类器,可以配置分类器,并稍后在框架中训练它们。 分类结果有助于在提取中应用正确的策略。
以下列表显示了可用的分类器:
- The Keyword Based Classifier activity is the first such classifier, targeting classification for titled documents.
- The Intelligent Keyword Classifier activity can not only classify but also "split" files that contain multiple document types within them.
- The Machine Learning Classifier activity can classify your files using a powerful ML Model, that you can train according to your needs.
- The Generative Classifier activity allows you to classify documents using generative models.
验证自动分类
You can achieve this using the Present Classification Station attended activity, which presents a document processing specific user interface for validating and correcting automatic classification outputs.
特别是对于涉及文件拆分的用例,强烈建议使用人工分类验证步骤,以确保用于数据提取的下游处理正常工作。
An alternative to the attended activity is available through the usage of Long-Running Workflows, which are designed to optimally enable human-robot collaboration. The Create Document Classification Action and the Wait For Document Classification Action And Resume activities enable this scenario.
训练分类器
You can achieve this using the Train Classifiers Scope activity. This empowers the closing of the feedback loop to any classification algorithm capable of learning. Drag and drop your classifier trainers within this Scope activity and enable them using the Configure Classifiers wizard to ensure that the information validated by humans through the Classification Station or Validation Station is used by your classifiers to improve their own performance.
分类与所使用的分类器一样有效。 如果文档未正确分类,则表示活动分类器未知该文档。 该框架提供了训练分类器的机会,以提高对文档类的识别。
以下是可用的分类器训练器列表:
- The Keyword Based Classifier Trainer is the trainer activity paired with the Keyword Based Classifier.
- The Intelligent Keyword Classifier Trainer enables the feedback loop for Intelligent Keyword Classifier.
- The Machine Learning Classifier Trainer is the trainer activity paired with the Machine Learning Classifier.
从文档中提取数据
You can achieve this using the Data Extraction Scope activity. This allows the usage of any data extraction algorithm for identifying different fields in a classified document.
提取只是从给定的文档类型中获取您感兴趣的数据。 例如,如果要使用字符串操作来从 5 页文档中提取特定数据,则非常麻烦。 在此框架中,您可以在同一数据提取作用域内针对不同的文档结构使用不同的提取程序。 提取结果将进一步传递以进行验证。
以下是可用提取程序的列表:
- The RegEx Based Extractor is a basic data extractor that applies regular expression matching to identify the best candidates for a specific field.
- The Form Extractor uses predefined templates to enable the processing of structured, fixed form documents.
- The Machine Learning Extractor leverages the power of AI and Machine Learning to identify information in structured or semi-structured documents by either using one of UiPath®'s public data extraction services or by calling custom trained Machine Learning models that you can build and host in AI Center.
- The Generative Extractor allows you to extract documents using generative models.
验证自动数据提取结果
You can achieve this using the Present Validation Station attended activity, which presents a document processing specific user interface for data validation and correction.
- The extracted data can be validated by a human user through the Validation Station. A best practice is to build logic around the decision of adding or not a human validation step, with rules depending on the specific use case to be implemented. Validation results can then be exported and used in further automation activities.
- You can also enable human validation through long-running workflows, optimizing human-robot collaboration, using the Create Document Validation Action and Wait for Document Validation Action and Resume activities.
火车提取器
You can achieve this using the Train Extractors Scope activity. This empowers the closing of the feedback loop to any data extraction algorithm capable of learning. Drag and drop your extractor trainers within this Scope activity and enable them using the Configure Extractors wizard to ensure that the information validated by humans through the Validation Station is used by your extractors to improve their own performance.
与所使用的提取器一样,提取效率很高。 如果未正确提取字段值,则表示活动的提取程序不知道这些字段值。 该框架提供了培训提取程序的机会,以提高对字段值的识别。
The Machine Learning Extractor Trainer closes the feedback loop for ML-based data extraction, by collecting the data required for retraining a Machine Learning model hosted in AI Center. This activity is the companion of the Machine Learning Extractor activity.
导出提取的信息
You can achieve this using the Export Extraction Results activity. This allows you to export the complex structure of extracted data to a simple DataSet (collection of DataTables).
获得经过验证的信息后,您可以按原样使用它,也可以将其保存为数据表格式,该格式可以很容易地转换为 Excel 文件。
The UiPath.IntelligentOCR.Activities package is compatible with any custom classification or data extraction activity that is built based on the public UiPath.DocumentProcessing.Contracts package. It offers full flexibility to build your own algorithm specific to your use case, as well as integrating it with any third-party solution for document classification and data extraction.
以下版本的包已从官方订阅源中删除。 如果您有任何问题,请联系我们的支持团队。