UiPath Documentation
activities
latest
false
重要 :
请注意,此内容已使用机器翻译进行了部分本地化。 新发布内容的本地化可能需要 1-2 周的时间才能完成。
UiPath logo, featuring letters U and I in white

Document Understanding 活动

上次更新日期 2026年4月22日

分类文档

UiPath.IntelligentOCR.StudioWeb.Activities.ClassifyDocument

描述

您可以通过选择所需的分类器和一个要分类的文档,使用此活动对多个文档进行分类。

备注:

The Classify Document activity uses public endpoints.

The supported languages for the generative models are the same as the used OCR engine used. For more information, check the OCR Supported languages page.

Unless this activity is the first Document Understanding activity part of a Studio workflow, the input should be Document Data. File should only be used as input if the activity is the first Document Understanding one part of a Studio workflow.

已知限制

The Generative Predefined project type and the corresponding extractors are not available in Automation Suite.

项目兼容性

Windows | 跨平台

配置

设计器面板
  • Input - Provide the input file or the Document Data object.
    重要提示:

    The maximum numbers of pages a file can have is 500. Files exceeding this limit fail to be classified.

    提示:

    When your files aren't stored as an IResource type variable, there's an option to perform a conversion. Use LocalResource.FromPath(<reference_to_the_file>) in the Input property field for this. Consider a scenario where you are iterating through a list of files using a For Each activity. Suppose currentItem is your iterating variable. To convert currentItem into IResource, paste LocalResource.FromPath(currentItem) into the Input field.

  • Document Understanding project - Requires you to select your Document Understanding project from the drop-down menu. The available options are:
    • Predefined - Project that uses pre-trained specialized models recommended for standard scenarios.
    • Generative Predefined - Project that uses pre-trained generative models accepting instructions as input for classification or extraction of document data.
    • 您连接的租户和文件夹中的现有项目
    • You can create a new project by selecting the + icon.
      备注:

      If you have created more than 500 projects on your tenant and use the Classify Document activity, UiPath Studio or Studio Web will not display any projects beyond the initial 500. Therefore, those projects cannot be used.

  • Classifier - If you are using the Predefined project, then you can select your desired Document Understanding classifier from the drop-down menu.
    备注:

    The data sent to the Generative Classifier will be sent to an LLM Model instance which is not publicly available, will not leave it, and once processed, it will not be stored or used for training.

    • For the Predefined project you have two options:
      • ML Classification – ML-based classifier.
      • Generative Classifier – The generative classifier type.
        • Document Type details - Instructions to identify Document Types, provided as key-value pairs, where the key represents the name of the Document Type and the value a description for it, helping the classifier identify such documents.
          • Document Type - Provide the name of the document type to be used as classification result (30-character limit).
          • Instruction - Requires you to provide instructions for the Generative Classifier on how to identify the document type. The maximum number of characters allowed is 1000.
    • For the Generative Predefined project you can only use the Generative Classifier.
  • Version - Use this property when using an existing Document Understanding modern project. Select the tag that corresponds to the project version from which you want to process data. For instance, if you choose the Production tag assigned to Version 3, the activity processes data from Version 3 of your project in the production environment. The default value for Version is Staging. If the Staging tag doesn't exist in your selected project, then the default value is Production. After selecting a tag, the activity displays a list of supported document types for that version.
属性面板

高级选项

  • Minimum confidence - Specify the minimum confidence threshold based on which a document type is assigned during classification. If a document's confidence score falls below this threshold, its Document Type is reported as "unknown".
    提示:

    Most document types generate a prediction with a confidence level. Setting this property prevents false positives by only considering the predictions with a confidence level above the threshold. You can identify an optimal confidence level by testing various documents within your workflow, recording the results in an Excel spreadsheet, for example, and then analyze what threshold value is the most accurate.

  • Design-time external connectionThe design-time external connection allows you to leverage the activity using Document Understanding resources from other projects or tenants. Before configuring these properties, ensure you have fulfilled the prerequisites mentioned in the Configuring runtime external connection page. Once these steps are completed, you can then proceed to configure the runtime external connection.
    • App ID: Enter the App ID of the external application you previously created.

    • App secret: Enter the App secret of the external application you previously created.

    • Tenant URL: Enter the URL of the tenant where you created the external application. This is the tenant from where you will use resources at design-time.

      URL 应采用以下格式: https://<baseURL>/<OrganizationName>/<TenantName>

输入

  • Timeout (seconds) - Maximum execution time (in seconds) for the call to the generative model. If the operation exceeds this timeout, it is automatically terminated to prevent delays or hangs. This property is only displayed if the Generative Classifier is selected as a classifier.

输出

  • Document Data - All the validated extracted field data from the file.

运行时外部连接

The runtime external connection allows you to execute the activity via on-premises robots. Before configuring these properties, ensure you have fulfilled the prerequisites mentioned in the Configuring runtime external connection page. Once these steps are completed, you can then proceed to configure the runtime external connection.

  • Runtime Credentials Asset
    • Use this field when you need to access Document Understanding resources while the robot is connected to a local Orchestrator, or from a different tenant. You can choose to enter a Credential Asset, for authentication purposes, in one of the following ways:
    • From the dropdown list, select the desired Credential Asset from the Orchestrator to which the UiPath® Robot is connected to.

    • 如果您在 Orchestrator 凭据资产中存储了用于访问项目的外部应用程序凭据,请手动输入 Orchestrator 凭据资产的路径。

      路径的格式应为: <OrchestratorFolderName>/<AssetName>

  • Runtime Tenant Url - Use this field, alongside the Runtime Credentials Asset field. Enter the URL of the tenant that the robot will connect to in order to execute the classification. The URL should be in the following format: https://<baseURL>/<OrganizationName>/<TenantName>.

使用生成式分类器

To quickly get started with the generative capabilities of the Classify Document activity, perform the following steps:

  1. Add a Classify Document activity
  2. From the Project dropdown list, select Predefined or Generative Predefined.
  3. For Classifier, select Generative Classifier. The Document Type Details property appears in the body of the activity.
  4. In the Document Type Details collection, provide your instructions as Dictionary key-value pairs, where:
    • Key represents the Document Type (example: CV).

    • Value represents the Generative prompt: The description used by the generative classifier to identify the document types. For example, check the following table for a sample of key-value pairs:

      Table 1. Key-value pairs used as a prompt for the generative classifier

      密钥
      计算机视觉“查找常见的简历关键字,例如“教育背景”、“技能”和“经验”。”
      发票“查找常见字段名称,例如“发票编号”、“收款人”或“总金额”。”

      Figure 1. Key-value pairs used as a prompt for the generative classifier 用作生成式分类器提示词的键值对

  • 描述
  • 已知限制
  • 项目兼容性
  • 配置
  • 使用生成式分类器

此页面有帮助吗?

连接

需要帮助? 支持

想要了解详细内容? UiPath Academy

有问题? UiPath 论坛

保持更新