UiPath Documentation
activities
latest
false
重要 :
请注意,此内容已使用机器翻译进行了部分本地化。 新发布内容的本地化可能需要 1-2 周的时间才能完成。
UiPath logo, featuring letters U and I in white

Document Understanding 活动

上次更新日期 2026年4月22日

分类文档作用域

UiPath.IntelligentOCR.Activities.DocumentClassification.ClassifyDocumentScope

描述

为分类器活动提供作用域,进而提供执行文档分类所需的所有必要文件。至少接受一个分类器或接受分类器之间的多个代理,确保将所有参数转发到子分类活动。

项目兼容性

Windows - Legacy | Windows

配置

属性面板

常见

  • “显示名称”- 活动的显示名称。

输入

  • DocumentObjectModel - The Document Object Model (DOM) you want to use to validate the document against. This model is stored in a Document variable and can be retrieved from the Digitize Document activity. Visit Digitize Document to learn how to use the activity. This field supports only Document variables.
  • DocumentPath - The path to the document you want to validate. This field supports only strings and String variables.
    备注:

    该属性字段支持的文件类型包括“.png”、“.gif”、“.jpe”、“.jpg”、“.jpeg”、“.tiff”、“.tif”、“.bmp”和“.pdf”。

  • DocumentText - The text of the document itself, stored in a String variable. You can retrieve this value from the Digitize Document activity. Visit Digitize Document to learn how to use this activity. This field supports only strings and String variables.
  • “分类”- 处理文档要依据的分类,存储在“DocumentTaxonomy”变量中。该字段仅支持“DocumentTaxonomy”变量。

其他

  • “私有”- 选中后将不再以“Verbose”级别记录变量和参数的值。
输出
  • “分类结果”- 对指定文件运行分类器文件后所得的结果,存储在“IReadOnlyList<ClassificationResult>”对象中。该字段仅支持“IReadOnlyList<ClassificationResult>”变量。

The ClassificationResult object contains the following information:

  • “文档类型 ID”- 与“分类”匹配的文档类型所对应的 ID。
  • “文档 ID”- 已处理文档的文件名。
  • “内容类型”- 已处理文档中包含的内容类型。
  • Confidence - Classification confidence, displayed as a numeric value between 0 and 1.
  • OcrConfidence - OCR confidence for the characters that are part of the reported reference, displayed as a numeric value between 0 and 1.
  • “引用”- 分类的证明,可用于文档的文本版(通过“TextStartIndex”和“TextLength”)和“文档对象模型”(通过“Tokens”以及用于从中选取证明的每个页面的高亮显示框)。
  • “文档边界”- 用于描述分类所属的文档部分,包括“StartPage”(“Int32”,从 0 开始)、“PageCount”(“Int32”)、“TextStartIndex”(“Int32”,从 0 开始)、“TextLength”(“Int32”)。
  • “分类器名称”- 由“分类文档作用域”活动自动填充,将填充为用于报告当前“分类结果”的分类器的显示名称。
    备注:

    ClassificationResults”将所有内容按可信度分数降序排列,这表示最顶部的内容具有最高可信度。

使用“配置分类器”向导

The Configure Classifiers Wizard allows you to configure the way the classifiers are applied to each document type, and what results are acceptable.

请按照以下步骤配置向导:

  1. 将“对文档范围进行分类”活动添加到工作流中。

  2. Add one or more classifier activities inside the Classify Document Scope activity.

    1. 为您的分类器提供暗示性名称。

    2. 按接受优先级的顺序,从左到右对范围内的分类器排序。

    3. Configure your classifiers selecting Configure Classifiers.

      You can now see the Wizard

      Figure 1. Overview of the Configure Classifiers wizard

      “配置分类器”向导概览

  3. 选中要激活的分类器和文档类型对的复选框。取消选中某个分类器的文档类型可能出现在以下任一情况:

    • 未训练或配置分类器,无法识别该特定文档类型
    • 分类器对该特定文档类型的执行不如预期,如果分类器返回此类结果,则应将其忽略。
  4. If a classifier has its own taxonomy, then use the text boxes next to each check box to set the correct Taxonomy mapping between the two taxonomies. For example, if Classifier1 has been configured to return class INV for an invoice, but your project taxonomy contains a document type called "Incoming Invoice", then the box corresponding to "Incoming Invoice" and that particular Classifier1 should contain the string INV.

  5. Set a Minimum Confidence threshold, from 0 to 100, for each classifier in the Classify Document Scope. Any classification result with a confidence lower than this threshold will not be stored in the Classify Document Scope activity output.

    提示:

    Most document types generate a prediction with a confidence level. Setting this property prevents false positives by only considering the predictions with a confidence level above the threshold. You can identify an optimal confidence level by testing various documents within your workflow, recording the results in an Excel spreadsheet, for example, and then analyze what threshold value is the most accurate. Apply the threshold by adjusting the Minimum Confidence property in your current scope.

  6. Select Save once all the classifiers are configured.

    Figure 2. The Configure Classifiers wizard configured to use a different classifier for each document type

    配置为对每种文档类型使用不同分类器的“配置分类器”向导

Document Understanding 集成

The Classify Document Scope activity is part of the Document Understanding solutions. Visit the Document Understanding Guide for more information.

此页面有帮助吗?

连接

需要帮助? 支持

想要了解详细内容? UiPath Academy

有问题? UiPath 论坛

保持更新