- Select the Use line item style data extraction check box. The remaining options on the tab are enabled.
-
Use the Keyword type drop-down list (located below the Column and Keyword Type columns) to select the Keyword Type that values in the first column of the table are assigned to as Keyword Values.
To ignore a column or capture it as XML data only, select <None> from the drop-down list.
- If this Keyword Value is required, select the Required check box (located to the right of the Keyword type drop-down list). If a Keyword Value that is marked as required is missing, the data from the entire row is discarded.
-
Optional: To specify a regular expression rule for the extracted text, enter the rule in the Expression field (located next to the Required check box). The OCR engine will compare the extracted text to the defined regular expression rule; if the text is a match, the value is stored as a Keyword Value. If the text does not match, it is discarded.
For example, if you specify the following regular expression rule: [[:upper:][:lower:][:digit:][:space:]]+ Any value containing a character that is not a letter, number, or space is discarded.
Note:To access the Regular Expression Library, click in the field and press F2. See The Regular Expression Library for more information.
Note:All regular expressions must be ECMA compliant.
Tip:Regular expressions can be used to discard column data that consists of entirely unwanted characters (e.g., a 123***456, where you want to capture 123 and 456 as separate Keyword Values and discard the asterisk separator column). However, if the unwanted characters are located in the middle of valid data (e.g., 123***456, where you want to capture 123456 as one Keyword Value), you can configure a Keyword Lookup/Replace dictionary entry to replace the data.
- Click Add.
-
Repeat Steps 2-5 for all columns in the table. Columns must be added to the list as they appear in the document from left-to-right in order for Keyword Values to be extracted correctly.
Select a configured column and use the Move Up and Move Down buttons to re-arrange it as needed. Select a configured column and click Delete to delete the column configuration.
- Click Save.