activities
latest
false
UiPath logo, featuring letters U and I in white

Document Understanding Activities

Last updated Jun 11, 2025

PREVIEW
Redact Document

UiPath.IntelligentOCR.Activities.Redaction.RedactDocument

Description

The Redact Document activity generates a redacted PDF based on the original input PDF (provided as the Document Path), the Extraction Result and Words To Redact input fields.

The Redact Document activity uses the Document Object Model to access the location of all words identified in the PDF, while the Extraction Result and Words To Redact fields are used as input for the data that should be redacted, as follows:

  • Every entry in the Words To Redact input array is considered as a string for a case-insensitive, continuous search within the document for redaction purposes.
  • Extraction Result values that have a reference will be redacted based on this reference value (including customer area selections as reference for a value). Both regular fields and table cells are redacted.
  • Extraction Result values that lack a reference (added without a reference to a field where Requires Reference is set to False) are considered similarly to entries in the Words To Redact field. This means that all occurrences of that specific text found in the input document will be redacted.

The activity searches for words using the Document Object Model, and fuzzy matching is not available.

For handling critically sensitive documents, we highly recommend performing human validation of extraction results and using reference-based values and selections. This ensures a comprehensive review of all data that needs to be redacted, minimizing the chances of OCR errors or word order issues affecting the final redaction output.

Project compatibility

Windows-Legacy | Windows

Configuration

Designer panel

Input

  • Document Path: The path to the document you want to redact.
  • Document Object Model: The Document Object Model for the input documented obtained from the Digitize Document activity.
  • Extraction Results (Optional): The extraction results of the data extraction process, stored in an ExtractionResult variable. You can retrieve this from the Data Extraction Scope activity.
  • Words To Redact (Optional): A list of strings that will be redacted, in addition to the data sourced from the Extraction Results input field.
  • Output File: The output file path where you want to save the redacted PDF.
Properties panel
Common
  • DisplayName: The display name of the activity.

Input

  • Document Path: The path to the document you want to redact.
  • Document Object Model: The Document Object Model for the input documented obtained from the Digitize Document activity.
  • Extraction Results (Optional): The extraction results of the data extraction process, stored in an ExtractionResult variable. You can retrieve this from the Data Extraction Scope activity.
  • Words To Redact (Optional): A list of strings that will be redacted, in addition to the data sourced from the Extraction Results input field.
  • Output File: The output file path where you want to save the redacted PDF.
Misc
  • Private: If selected, the values of variables and arguments are no longer logged at Verbose level.

Output

  • Output File: The output file which contains the redacted information.

Redaction Settings

  • BorderColor: The color of the border used for redaction.
  • BorderThickness: The thickness of the border used for redaction.
  • Dpi: The Dots per inch value that represents the quality of the images embedded in the redacted PDF.
  • FillColor: The fill color used for redaction.
  • Description
  • Project compatibility
  • Configuration

Was this page helpful?

Get The Help You Need
Learning RPA - Automation Courses
UiPath Community Forum
Uipath Logo White
Trust and Security
© 2005-2025 UiPath. All rights reserved.