UiPath Documentation
document-understanding
2023.10
false
重要 :
新发布内容的本地化可能需要 1-2 周的时间才能完成。
UiPath logo, featuring letters U and I in white

Document Understanding 用户指南

上次更新日期 2026年4月6日

复选框和签名

复选框

有几种类型的多选字段使用复选框:

  • 互斥复选框
  • 非互斥复选框,您可以在其中选择多个选项。

另一个重要方面是给定多选字段的可用选项数量。在某些情况下,可能只有一个选项,即复选框是否处于选中状态,而在其他情况下,可能有 10、20 或更多选项,以网格或表格的形式排列,就像许多体检表单一样。

您可以通过两种主要方式来为这些类型的多项选择字段添加标签。

Label the options

Let's take an example to understand how you can label the options. Forms can include the options Project or Policy. In this case, you only have one field, and you only label the selected word, i.e. label the word Project if the checkbox next to it is checked or the word Policy if the checkbox next to it is checked. If neither is checked then you label neither, and both being checked is not possible, and such documents would just be deleted from the training set.

标记界面的屏幕截图

此方法的优点是您只有一个字段,需要的数据较少。它的另一个优点是不依赖于复选框的成功检测。如果复选框被检测为字母 X,则模型仍可以学习去识别这意味着它旁边的选项已被选中。

The disadvantage is that you need to make sure both options are roughly equally represented, which is not always the case. Potentially, in your training set, 90% of the documents might have Project checked. In this case, the model cannot perform well and this approach fails. The problem gets worse when you have more options because some of them are almost always rare. In these cases you may need to create fake documents with the rare options checked to balance things out.

使用单个多值字段为选项添加标签

这也使添加标签更容易,对复选框检测错误不那么敏感,但可能对不平衡的选项更敏感,就像第一个选项一样。

标记界面的屏幕截图

签名检测

Starting with the 2022.4 LTS Enterprise release, signatures can be detected using the UiPath Document OCR, hence, Machine Learning Models can directly detect signatures.

像在文档中为任何其他字段添加标签一样,为签名添加标签。只要被 UiPath 文档 OCR 检测到,机器学习模型就会学习将该字段识别为签名。

备注:

For a more in depth signature analysis, you can also use the Signature Comparison model that resides in the UiPath Image Analysis ML Packages.

  • 复选框
  • Label the options
  • 使用单个多值字段为选项添加标签
  • 签名检测

此页面有帮助吗?

连接

需要帮助? 支持

想要了解详细内容? UiPath Academy

有问题? UiPath 论坛

保持更新