Document Understanding Activities

Last updated Jul 4, 2025

PREVIEW
Redact Document

UiPath.IntelligentOCR.Activities.Redaction.RedactDocument

Description

The Redact Document activity generates a redacted PDF based on the original input PDF (provided as the Document Path), the Extraction Result and Words To Redact input fields.

The Redact Document activity uses the Document Object Model to access the location of all words identified in the PDF, while the Extraction Result and Words To Redact fields are used as input for the data that should be redacted, as follows:

Every entry in the Words To Redact input array is considered as a string for a case-insensitive, continuous search within the document for redaction purposes.
Extraction Result values that have a reference will be redacted based on this reference value (including customer area selections as reference for a value). Both regular fields and table cells are redacted.
Extraction Result values that lack a reference (added without a reference to a field where Requires Reference is set to False) are considered similarly to entries in the Words To Redact field. This means that all occurrences of that specific text found in the input document will be redacted.

The activity searches for words using the Document Object Model, and fuzzy matching is not available.

For handling critically sensitive documents, we highly recommend performing human validation of extraction results and using reference-based values and selections. This ensures a comprehensive review of all data that needs to be redacted, minimizing the chances of OCR errors or word order issues affecting the final redaction output.

Project compatibility

Windows-Legacy | Windows

Configuration

Designer panel

Input

Document Path: The path to the document you want to redact.
Document Object Model: The Document Object Model for the input documented obtained from the Digitize Document activity.
Extraction Results (Optional): The extraction results of the data extraction process, stored in an ExtractionResult variable. You can retrieve this from the Data Extraction Scope activity.
Words To Redact (Optional): A list of strings that will be redacted, in addition to the data sourced from the Extraction Results input field.
Output File: The output file path where you want to save the redacted PDF.

Properties panel

Common

DisplayName: The display name of the activity.

Input

Document Path: The path to the document you want to redact.
Document Object Model: The Document Object Model for the input documented obtained from the Digitize Document activity.
Extraction Results (Optional): The extraction results of the data extraction process, stored in an ExtractionResult variable. You can retrieve this from the Data Extraction Scope activity.
Words To Redact (Optional): A list of strings that will be redacted, in addition to the data sourced from the Extraction Results input field.
Output File: The output file path where you want to save the redacted PDF.

Misc

Private: If selected, the values of variables and arguments are no longer logged at Verbose level.

Output

Output File: The output file which contains the redacted information.

Redaction Settings

BorderColor: The color of the border used for redaction.
BorderThickness: The thickness of the border used for redaction.
Dpi: The Dots per inch value that represents the quality of the images embedded in the redacted PDF.
FillColor: The fill color used for redaction.

On this page

Description
Project compatibility
Configuration

Was this page helpful?

PREVIOUSCompact Validation Station

NEXTCreate Document Validation Action

Support and Services

Get The Help You Need

UiPath Academy

Learning RPA - Automation Courses

UiPath Forum

UiPath Community Forum

Trust and Security

Cookies Policy