- 概述
- 文档处理合同
- 发行说明
- 关于文档处理合同
- Box 类
- IPersistedActivity 接口
- PrettyBoxConverter 类
- IClassifierActivity 接口
- IClassifierCapabilitiesProvider 接口
- 分类器文档类型类
- 分类器结果类
- 分类器代码活动类
- 分类器原生活动类
- 分类器异步代码活动类
- 分类器文档类型功能类
- ContentValidationData Class
- EvaluatedBusinessRulesForFieldValue Class
- EvaluatedBusinessRuleDetails Class
- 提取程序异步代码活动类
- 提取程序代码活动类
- 提取程序文档类型类
- 提取程序文档类型功能类
- 提取程序字段功能类
- 提取程序原生活动类
- 提取程序结果类
- FieldValue Class
- FieldValueResult Class
- ICapabilitiesProvider 接口
- IExtractorActivity 接口
- 提取程序有效负载类
- 文档操作优先级枚举
- 文档操作数据类
- 文档操作状态枚举
- 文档操作类型枚举
- 文档分类操作数据类
- 文档验证操作数据类
- 用户数据类
- 文档类
- 文档拆分结果类
- DomExtensions 类
- 页类
- 页面分区类
- 多边形类
- 多边形转换器类
- 元数据类
- 词组类
- 词类
- 处理源枚举
- 结果表格单元类
- 结果表值类
- 结果表列信息类
- 结果表类
- 旋转枚举
- Rule Class
- RuleResult Class
- RuleSet Class
- RuleSetResult Class
- 分区类型枚举
- 词组类型枚举
- IDocumentTextProjection 接口
- 分类结果类
- 提取结果类
- 结果文档类
- 结果文档范围类
- 结果数据点类
- 结果值类
- 结果内容引用类
- 结果值令牌类
- 结果派生字段类
- 结果数据源枚举
- 结果常量类
- 简单字段值类
- 表字段值类
- 文档组类
- 文档分类类
- 文档类型类
- 字段类
- 字段类型枚举
- FieldValueDetails Class
- 语言信息类
- 元数据输入类
- 文本类型枚举
- 类型字段类
- ITrackingActivity 接口
- ITrainableActivity 接口
- ITrainableClassifierActivity 接口
- ITrainableExtractorActivity 接口
- 可训练的分类器异步代码活动类
- 可训练的分类器代码活动类
- 可训练的分类器原生活动类
- 可训练的提取程序异步代码活动类
- 可训练的提取程序代码活动类
- 可训练的提取程序原生活动类
- 基本数据点类 - 预览
- 提取结果处理程序类 - 预览
- Document Understanding ML
- Document Understanding OCR 本地服务器
- Document Understanding
- 智能 OCR
- 发行说明
- 关于“智能 OCR”活动包
- 项目兼容性
- 加载分类
- 将文档数字化
- 分类文档作用域
- 基于关键词的分类器
- Document Understanding 项目分类器
- 智能关键词分类器
- 创建文档分类操作
- 创建文档验证工件
- 检索文档验证工件
- 等待文档分类操作然后继续
- 训练分类器范围
- 基于关键词的分类训练器
- 智能关键词分类训练器
- 数据提取作用域
- Document Understanding 项目提取程序
- Document Understanding 项目提取程序训练器
- 基于正则表达式的提取程序
- 表单提取程序
- 智能表单提取程序
- 文档脱敏
- 创建文档验证操作
- 等待文档验证操作然后继续
- 训练提取程序范围
- 导出提取结果
- 机器学习提取程序
- 机器学习提取程序训练器
- 机器学习分类器
- 机器学习分类训练器
- 生成分类器
- 生成式提取程序
- 配置身份验证
- ML 服务
- OCR
- OCR 合同
- OmniPage
- PDF
- [未公开] Abbyy
- [未列出] Abbyy 嵌入式

Document Understanding 活动
分类文档
UiPath.IntelligentOCR.StudioWeb.Activities.ClassifyDocument
描述
您可以通过选择所需的分类器和一个要分类的文档,使用此活动对多个文档进行分类。
The Classify Document activity uses public endpoints.
The supported languages for the generative models are the same as the used OCR engine used. For more information, check the OCR Supported languages page.
Unless this activity is the first Document Understanding activity part of a Studio workflow, the input should be Document Data. File should only be used as input if the activity is the first Document Understanding one part of a Studio workflow.
已知限制
The Generative Predefined project type and the corresponding extractors are not available in Automation Suite.
项目兼容性
Windows | 跨平台
配置
设计器面板
- Input - Provide the input file or the Document Data object.
重要提示:
The maximum numbers of pages a file can have is 500. Files exceeding this limit fail to be classified.
提示:When your files aren't stored as an
IResourcetype variable, there's an option to perform a conversion. UseLocalResource.FromPath(<reference_to_the_file>)in the Input property field for this. Consider a scenario where you are iterating through a list of files using a For Each activity. SupposecurrentItemis your iterating variable. To convertcurrentItemintoIResource, pasteLocalResource.FromPath(currentItem)into the Input field. - Document Understanding project - Requires you to select your Document Understanding project from the drop-down menu. The available options are:
- Predefined - Project that uses pre-trained specialized models recommended for standard scenarios.
- Generative Predefined - Project that uses pre-trained generative models accepting instructions as input for classification or extraction of document data.
- 您连接的租户和文件夹中的现有项目
- You can create a new project by selecting the + icon.
备注:
If you have created more than 500 projects on your tenant and use the Classify Document activity, UiPath Studio or Studio Web will not display any projects beyond the initial 500. Therefore, those projects cannot be used.
- Classifier - If you are using the Predefined project, then you can select your desired Document Understanding classifier from the drop-down menu.
备注:
The data sent to the Generative Classifier will be sent to an LLM Model instance which is not publicly available, will not leave it, and once processed, it will not be stored or used for training.
- For the Predefined project you have two options:
- ML Classification – ML-based classifier.
- Generative Classifier – The generative classifier type.
- Document Type details - Instructions to identify Document Types, provided as key-value pairs, where the key represents the name of the Document Type and the value a description for it, helping the classifier identify such documents.
- Document Type - Provide the name of the document type to be used as classification result (30-character limit).
- Instruction - Requires you to provide instructions for the Generative Classifier on how to identify the document type. The maximum number of characters allowed is 1000.
- Document Type details - Instructions to identify Document Types, provided as key-value pairs, where the key represents the name of the Document Type and the value a description for it, helping the classifier identify such documents.
- For the Generative Predefined project you can only use the Generative Classifier.
- For the Predefined project you have two options:
- Version - Use this property when using an existing Document Understanding modern project. Select the tag that corresponds to the project version from which you want to process data. For instance, if you choose the Production tag assigned to Version 3, the activity processes data from Version 3 of your project in the production environment. The default value for Version is Staging. If the Staging tag doesn't exist in your selected project, then the default value is Production. After selecting a tag, the activity displays a list of supported document types for that version.
属性面板
高级选项
- Minimum confidence - Specify the minimum confidence threshold based on which a document type is assigned during classification. If a document's confidence score falls below this threshold, its Document Type is reported as "unknown".
提示:
Most document types generate a prediction with a confidence level. Setting this property prevents false positives by only considering the predictions with a confidence level above the threshold. You can identify an optimal confidence level by testing various documents within your workflow, recording the results in an Excel spreadsheet, for example, and then analyze what threshold value is the most accurate.
- Design-time external connectionThe design-time external connection allows you to leverage the activity using Document Understanding resources from other projects or tenants. Before configuring these properties, ensure you have fulfilled the prerequisites mentioned in the Configuring runtime external connection page. Once these steps are completed, you can then proceed to configure the runtime external connection.
-
App ID: Enter the App ID of the external application you previously created.
-
App secret: Enter the App secret of the external application you previously created.
-
Tenant URL: Enter the URL of the tenant where you created the external application. This is the tenant from where you will use resources at design-time.
URL 应采用以下格式:
https://<baseURL>/<OrganizationName>/<TenantName>。
-
输入
- Timeout (seconds) - Maximum execution time (in seconds) for the call to the generative model. If the operation exceeds this timeout, it is automatically terminated to prevent delays or hangs. This property is only displayed if the Generative Classifier is selected as a classifier.
输出
- Document Data - All the validated extracted field data from the file.
运行时外部连接
The runtime external connection allows you to execute the activity via on-premises robots. Before configuring these properties, ensure you have fulfilled the prerequisites mentioned in the Configuring runtime external connection page. Once these steps are completed, you can then proceed to configure the runtime external connection.
- Runtime Credentials Asset
- Use this field when you need to access Document Understanding resources while the robot is connected to a local Orchestrator, or from a different tenant. You can choose to enter a Credential Asset, for authentication purposes, in one of the following ways:
-
From the dropdown list, select the desired Credential Asset from the Orchestrator to which the UiPath® Robot is connected to.
-
如果您在 Orchestrator 凭据资产中存储了用于访问项目的外部应用程序凭据,请手动输入 Orchestrator 凭据资产的路径。
路径的格式应为:
<OrchestratorFolderName>/<AssetName>。
- Runtime Tenant Url - Use this field, alongside the Runtime Credentials Asset field. Enter the URL of the tenant that the robot will connect to in order to execute the classification. The URL should be in the following format:
https://<baseURL>/<OrganizationName>/<TenantName>.
使用生成式分类器
To quickly get started with the generative capabilities of the Classify Document activity, perform the following steps:
- Add a Classify Document activity
- From the Project dropdown list, select Predefined or Generative Predefined.
- For Classifier, select Generative Classifier. The Document Type Details property appears in the body of the activity.
- In the Document Type Details collection, provide your instructions as Dictionary key-value pairs, where:
-
Key represents the Document Type (example: CV).
-
Value represents the Generative prompt: The description used by the generative classifier to identify the document types. For example, check the following table for a sample of key-value pairs:
Table 1. Key-value pairs used as a prompt for the generative classifier
密钥 值 计算机视觉 “查找常见的简历关键字,例如“教育背景”、“技能”和“经验”。” 发票 “查找常见字段名称,例如“发票编号”、“收款人”或“总金额”。” Figure 1. Key-value pairs used as a prompt for the generative classifier

-