- Overview
- Getting started
- Building models
- Consuming models
- Model Details
- Public endpoints
- 1040 - document type
- 1040 Schedule C - document type
- 1040 Schedule D - document type
- 1040 Schedule E - document type
- 1040x - document type
- 3949a - document type
- 4506T - document type
- 709 - document type
- 941x - document type
- 9465 - document type
- ACORD125 - document type
- ACORD126 - document type
- ACORD131 - document type
- ACORD140 - document type
- ACORD25 - document type
- Bank Statements - document type
- Bills Of Lading - document type
- Certificate of Incorporation - document type
- Certificate of Origin - document type
- Checks - document type
- Children Product Certificate - document type
- CMS 1500 - document type
- EU Declaration of Conformity - document type
- Financial Statements - document type
- FM1003 - document type
- I9 - document type
- ID Cards - document type
- Invoices - document type
- Invoices2 - document type
- Invoices Australia - document type
- Invoices China - document type
- Invoices Hebrew - document type
- Invoices India - document type
- Invoices Japan - document type
- Invoices Shipping - document type
- Packing Lists - document type
- Payslips - document type
- Passports - document type
- Purchase Orders - document type
- Receipts - document type
- Receipts2 - document type
- Receipts Japan - document type
- Remittance Advices - document type
- UB04 - document type
- US Mortgage Closing Disclosures - document type
- Utility Bills - document type
- Vehicle Titles - document type
- W2 - document type
- W9 - document type
- Supported languages
- Insights dashboards
- Data and security
- Logging
- Licensing
- How to
- Troubleshooting

Document Understanding Modern Projects User Guide
When creating a new project, tenants located in Europe, the US and Japan may enable our new splitter and classifier model. This trainable model can automatically split and classify complex documents, enabling you to turn messy packets into clean, typed documents.
Follow the instructions from this page to a Document UnderstandingTM project and enable the new splitter and classifier model:
- Open Document Understanding.
- Select Create project.
- Fill in the desired project name.
- Select Modern to use the modern
experience.
Note: This is a guided model building experience that also includes recommendations for optimal model performance and active learning.
- Switch on the Enable new splitter and classifier model toggle.
- Configure Advanced options if needed.
- Switch on the Enable splitting toggle to enable the model to
automatically split documents into individual files before
classification. You can also enable this feature from the Project
settings screen.
Important: When the Enable splitting option is turned off, all documents are classified as a whole.
- Select the OCR you want to use for your new project from the OCR method drop-down list.
- Fill in the OCR API Key.
Note: This field is filled in automatically if you select a UiPath® OCR.
- Fill in the OCR URL. For the full list of URLs for UiPath OCRs, check the Public Endpoints page.
- Choose if you want to Apply OCR on PDFs. This is set to Auto by default.
- Switch on the Enable splitting toggle to enable the model to
automatically split documents into individual files before
classification. You can also enable this feature from the Project
settings screen.
- Select Create.
Figure 1. Figure 1. Creating your first project
After successfully creating your project, you can upload your documents from the Build section.
Choose one of the two available options:
- Extract data from
documents: pulls specific fields, such as invoice numbers, dates, total
from your documents etc.
Note: We recommend choosing this option when you need structured data for automation or reporting.
- Classify and split
documents: sorts documents by type and separates multiple documents
within a single file.
Note: We recommend choosing this option when you need to organize and prepare documents for extraction.
- Select a document type.
- Select Upload or drag and drop your files inside the new document type.
Wait for the upload to finish.
Certain complex files contain several document types. Our new model can detect where each sub-document starts and ends and classify each section accordingly.
-
Click on Classify and Split Documents and upload your document packets. Wait for the document to finish uploading and processing.
-
Select any documents from the upload section and click Split. This will open up the splitting annotation interface.
Note: If the project already has a trained model, uploaded documents will be pre-annotated using that model. This helps speed up annotation and allows you to view prediction results on new documents. -
Click New document type to create a document type for each item in your desired taxonomy. You can select a predefined document type or create a custom one.
Note: For custom document types, provide a name, a short description explaining its purpose, and comma-separated key indicators (such as unique fields or terms) that help identify it. -
Indicate where documents should be separated. Assign each page range to a document type using the dropdown menu. Once you have finished annotating the document, click Confirm.
Note: Clicking Confirm triggers document processing. After processing, each sub-document will appear under its corresponding document type in the Build section.Note: Each sub-document moved to a document type will get pre-annotated with the schema of the document type.
Model training
Model training is triggered only after:
- At least five sub-documents have been created and annotated;
- A document has been confirmed.
The training status can be viewed in the upper right-hand corner of the Classification pane.
- The maximum document size is 160MB or 500 pages.
- Pages cannot be reordered or deleted.
Splitting and classification predictions
Whenever a new model is trained, all documents within the project receive predictions from the trained model. This allows you to review the performance of the classification model.
The “Type” column displays the ground truth, which is the document type as it was annotated. The “Predicted type” column shows the type predicted by the model.
By default, only the document packets are displayed in the UI. To view the sub-documents within each packet, click View and select the Include sub-documents checkbox.
Predictions can also be viewed in the annotation interface by enabling the “Show Prediction” toggle.