document-understanding
latest
false
UiPath logo, featuring letters U and I in white
Document Understanding User Guide
Automation CloudAutomation Cloud Public SectorAutomation SuiteStandalone
Last updated Nov 7, 2024

UiPath® DocPath

The DocPath large language model (LLM) is our latest data extraction model technology, designed to replace current generation models used within UiPath® Document UnderstandingTM. While DocPath operates similarly to previous models, it was trained using a wide variety of documents. This enables it to process common document types with little to no training needed. What sets DocPath LLM apart is its generative architecture, which significantly improves accuracy and simplifies extraction. Additionally, you can also fine-tune the model with your unique datasets.

To gain further insights into the DocPath architecture and the techniques used for training, check the DocPath page from our AI blog.

Availability

Currently, UiPath DocPath is only available for US-based tenants. Support for other regions is planned to roll out in early 2025.

Improvements over previous generation

DocPath LLM offers numerous enhancements over previous models. It improves accuracy, especially with tables, adapts to various document layouts to reduce annotation efforts, and boosts automation rates.

Key improvements include:
  • Improved accuracy: DocPath LLM delivers a higher accuracy rate and superior F1 score for semi-structured documents such as invoices, receipts, and purchase orders. This ensures precise and consistent data extraction.
  • Effortless annotation: The model reduces manual work by only requiring one annotation per document, eliminating the need to annotate each field instance on every page.
  • Enhanced automation: With a greater correlation between confidence level and accuracy, DocPath LLM enhances automation rates while reducing the number of documents sent to Action Center for the same accuracy level.

From our internal tests, DocPath outperformed its predecessor in performance. It reduced the false positive rate by around 15%, and the false negative rate dropped by nearly 17%.

How to use DocPath

The DocPath LLM is available exclusively for Document Understanding modern projects. Despite the introduction of DocPath, all existing project versions will still use current model versions. This ensures a seamless transition without any disruption to ongoing production workflows.

To start training an exisiting document type on DocPath, unconfirm and confirm all fields in a few documents.

  1. Choose the document type you want to train on DocPath.
  2. Select a document.
  3. Select all fields from the document and choose Delete.


  4. Annotate all the fields from the document and select Confirm.
    Note: Repeat steps 3 and 4 until training is initiated on the chosen document type.


How to check if DocPath is enabled

After training your models on DocPath, check the model version to make sure that DocPath is enabled.
  1. Go to the Publish page and create a new project version.
  2. Select the three-dot icon next to the project version and choose Edit version to check the model version.
    Note: All models version 24.7 and above are UiPath DocPath models.


Optimizing results

The field names you choose can greatly impact the performance of the model. To ensure optimal results, use natural language and proper grammar for field names. You should only use widely recognized acronyms such as Number (No), Account (Acct), Address (Addr), and Apartment (Apt). Currently, only West European languages are supported, so make sure that the chosen field names align with these languages. Refrain from using non-descriptive names, such as "Column 3", unless the document specifically uses that terminology.

UiPath® DocPath known limitations

The following limitations currently apply for UiPath DocPath:
  • The extracted fields must match exactly with the text in the documents. This process does not include summarization or other types of text analysis.
  • Custom training is not applicable for the following document types. If you attempt to use DocPath for these, it will result in an error:
    • Invoices China
    • Invoices Hebrew
    • Invoices Japan

Was this page helpful?

Get The Help You Need
Learning RPA - Automation Courses
UiPath Community Forum
Uipath Logo White
Trust and Security
© 2005-2024 UiPath. All rights reserved.