A Grouped Line Item Extraction Data Field Zone allows you to extract Keyword Values from multiple groups, or tables, within a single zone.
Keyword Values in each group can be identified by a tag (either a literal tag or a regular expression), regular expression, or by columns in line item data.
Grouped Line Item Extraction Data Field Zones are intended to be used with Multi-Instance Keyword Type Groups in order to capture data from multiple tables while maintaining the Keyword Values' relationship to one another.
For example:
Here you have two groups of course data organized by term. For each course, you want to identify five pieces of information: the term the course was taken, the Course ID, the Course Description, Grade, and Credits). This course information is to be stored in a Multi-Instance Keyword Type Group; each instance represents information about one course the student has completed.
In this example, the Term Keyword Value is identified by a regular expression, and the remaining course information is identified by the column that it is displayed in.
A Grouped Line Item Data Field Zone is configured using the options on the Grouped Line Item Extraction tab of the Data Field Zone dialog box.
Unlike Line Item Extraction Zones, Grouped Line Item Extraction Zones use only manual table decomposition.
To configure a Grouped Line Item Extraction Zone, select the Use grouped line item style data extraction check box. The remaining options on the tab are enabled and a preview of the Data Field Zone is displayed to the right of the Data Field Zone dialog box.
There are several steps required to configure a Grouped Line Item Extraction Zone:
- Specify How Line Item Data is to be Extracted as Keyword Values. Each column in the group is mapped to a Keyword Type.
- Specify How Data is to be Extracted as Keyword Values Using Tags or Regular Expressions. Data identified by a tag (tags can be literal text or regular expressions) or a regular expression can be assigned as Keyword Values to specified Keyword Types.
- Configure the Additional Zone Configuration Options. Once you have specified how the OCR engine should address the data in the group, you can specify other configuration options for the zone (e.g., page locations, VB scripts, Suspect Level, etc.).