Document Understanding user guide

Last updated Apr 6, 2026

DELIVERY:

RegEx Based Extractor

What is RegEx Based Extractor

The Regex Based Extractor is the perfect tool for simple use cases, in which, for certain fields, data is always found in a strict, predictable format and context. In other words, if you have a field for which you can define a Regular Expression that is consistently good when matched, then the Regex Based Extractor is a good choice.

The activity comes with a configuration wizard that assists you in defining the regular expressions for the fields you want to target for data extraction in this way.

The activity supports both simple fields as well as table field extraction.

It is recommended to look into other extraction methods, in case there is a high variability of the context and format of the expected values. In such cases, either a Form Extractor or a Machine Learning Extractor may be better suited.

This extractor does not have learning (training) capabilities and requires up-front configuration.

Special requirements

There are no special requirements for using the Regex Based Extractor.

How to configure

Activity configuration

The Regex Based Extractor has two major configurations to be considered:

the Configure Regular Expressions wizard - which allows you to define regular expressions for certain fields. This wizard also makes available the Regex Editor wizard, which assists you in building your regular expressions.
the UseVisualAlignment setting - which allows you to control whether the regular expressions configured for an extractor should be applied to the text output of the digitization component, or to a text version in which text lines are organized visually, and words are rearranged on lines based on their visual alignment.

On this page

What is RegEx Based Extractor
Special requirements
How to configure
Activity configuration

Was this page helpful?

PREVIOUSMachine Learning Extractor

NEXTData Extraction Validation

Document Understanding user guide

What is RegEx Based Extractor​

Special requirements​

How to configure​

Activity configuration​

Was this page helpful?

What is RegEx Based Extractor

Special requirements

How to configure

Activity configuration