Configuring an Optical Mark Data Field Zone - Advanced Capture - Foundation 23.1 - Foundation 23.1 - Ready - OnBase - Premier - external - Standard - Essential - Premier - Standard - Essential

Advanced Capture

Platform
OnBase
Product
Advanced Capture
Release
Foundation 23.1
License
Premier
Standard
Essential

An OMR Data Field Zone is configured using the options on the Optical Mark Field tab of the Data Field Zone dialog box.

Data Field Zone Optical Mark Field Options

Description

Keyword type

Using this drop-down list, select the Keyword Type that this Data Field Zone will assign a Keyword Value to.

If the currently-displayed Advanced Capture form has an assigned Document Type, only the Keyword Types associated with the assigned Document Type are available in the drop-down list. If no Document Type is assigned to the currently-displayed Advanced Capture form, only the <None> option is available in the drop-down list.

Note:

If the Keyword Value identified by the OCR engine exceeds the maximum length of the Keyword Type it is assigned to, then the Keyword Value is truncated to fit this length.

XML Node

Note:

This field is only displayed if the Advanced Capture form is configured to create an XML rendition of documents matched to it (i.e., the Create XML data rendition option is selected for the Advanced Capture form).

Note:

This field is disabled if the OMR item group mode check box is selected.

Enter the name of the XML node (i.e., the element) that Keyword Values extracted from this zone are contained in when the XML rendition of the document is created.

Note:

While the XML Node name may include alphanumeric characters, underscores (_), hyphens (-), or periods (.), it must begin with either a letter or an underscore.

OMR item group mode

Note:

To use this feature, multiple optical marks must be reside in the Data Field Zone.

Select this check box to allow the OCR engine to process multiple optical marks inside the Data Field Zone. Each optical mark must share all of the same configuration options (e.g., Keyword Type, Framed/Unframed, Sensitivity level, etc.) except for their associated positive and negative Keyword Values.

When this check box is selected, the Keyword Values section of the Data Field Zone dialog box is displayed as a list containing Item #, Positive Value, and Negative Value, and an image snippet of the area of the document selected for the Data Field Zone is displayed next to the Data Field Zone dialog box.

From the image snippet, use the pointer to draw a box around the first optical mark in the Data Field Zone. The Modify Keyword Result for OMR Group Item dialog box is displayed, allowing you to specify the positive and negative Keyword Value for that optical mark. Click Save when finished. To re-open the dialog box and edit these values once they have been configured, double-click on the corresponding entry in the table, make the desired changes, and click Save.

If the Advanced Capture form is configured to create XML renditions of the documents it is matched to, the XML Node field is displayed, giving you the opportunity to name the XML node that the Keyword Values are contained in when the XML rendition is created.

When the OMR item group mode option is selected, the Minimum Positive and Maximum Positive options are displayed between the Sensitivity and Advanced filtering options.

In the Minimum Positive and Maximum Positive fields, you can specify the minimum and maximum number of optical marks that the OCR engine should detect within the Data Field Zone, respectively. If the number of optical marks detected is less than the specified minimum or greater than the specified maximum value, the values for the Data Field Zone will be automatically marked as suspect.

OMR item group mode (cont.)

For example, if a form has two check boxes for gender, and you only expect one box to be checked (i.e., either male or female), you can specify both the Minimum Positive and Maximum Positive values to be 1. If neither box is checked, or if both boxes are checked, the values for the zone will be marked as suspect.

By default, the Minimum Positive and Maximum Positive values are both set to 0, which keeps minimum/maximum optical mark validation disabled.

When finished, click Save. The optical mark configuration zone is highlighted on the image snippet. Right-click on the optical mark configuration within the image snippet to move, resize, or delete the configuration information for that optical mark configuration zone.

Repeat this process for each optical mark displayed in the Data Field Zone.

Keyword Values

In the Positive Result Value field, enter the Keyword Value that is assigned if the OCR engine determines that an optical mark is present.

In the Negative Result Value field, enter the Keyword Value that is assigned if the OCR engine determines that an optical mark is not present.

For example: you are processing an application and one of the questions asks the applicant to select a check box if he/she has previously applied to the university. Depending on the answer to this question, a Keyword Value is assigned to the Previously Applied Keyword Type.

  • If the check box is selected, Yes is assigned as the Previously Applied Keyword Value.

  • If the check box is not selected, No is assigned as the Previously Applied Keyword Value.

Page Location(s)

The Page Location(s) options control the pages that the OCR engine searches for a particular Data Field Zone.

  • Select the Absolute page radio button if the data being read in the Data Field Zone is only displayed on one page and is always displayed on the same page (e.g., the optical mark is always displayed on page 1). Enter the page number that the optical mark is located on in the associated field.

  • Select the Relative page radio button if the data may be located on one or more pages relative to the length of the document. Select one or more of the following check boxes to indicate which page(s) the optical mark may be located on.

    Select the First page check box if the optical mark is located only on the first page of the document. This option can be used in conjunction with the Interior pages and/or Last page check boxes.

    Select the First page (not only page) check box if the optical mark is located on the first page and other pages in the document.

    Select the Interior pages check box if the optical mark is located on every page of the document other than the first or last page. This option can be used in conjunction with the First page and/or Last page check boxes.

    Select the Last page check box if the optical mark is located only on the last page of the document. This option can be used in conjunction with the First page and/or the Interior pages check boxes.

    Select the Last page (not only page) check box if the optical mark is located on the last page and other pages in the document.

Mark Frame Detection

Select the radio button that describes the optical mark being read by the OCR engine.

  • Framed. Select the Framed radio button if the optical mark is located in a pre-defined space, such as a check box or bubble.

  • Unframed. Select the Unframed radio button if the optical mark is located in an undefined or blank space.

  • Auto-detect. Select the Auto-detect radio button if the OCR engine should determine if the optical mark is framed or unframed.

Tip:

Selecting the Framed or Unframed selection, where appropriate, will help to increase the accuracy of the Advanced Capture process.

Sensitivity

Select a radio button to determine the Suspect Level for this Data Field Zone.

  • Highest. Sets the Suspect Level for this Data Field Zone to 70.

  • Low. Sets the Suspect Level for this Data Field Zone to 75.

  • Lower. Sets the Suspect Level for this Data Field Zone to 80.

  • Lowest. Sets the Suspect Level for this Data Field Zone to 85.

The Suspect Level is the level of confidence placed in the processing results for this field.

After a zone is processed, the OCR engine gives the resulting value a score between 1 and 99, depending on how confident it is in the result that was returned. The higher the score is, the lower the OCR engine's confidence is in the results.

The Sensitivity level selected for this field is the threshold at which the OCR engine determines if a returned value is acceptable or suspect. A score returned by the OCR engine higher than the Suspect Level threshold you set causes the value captured from the zone to be marked as suspect. All scores lower than the Suspect Level threshold indicate that the captured value is considered by the OCR engine to be acceptable.

For example, setting the Sensitivity to Lowest would indicate you have a fair amount of confidence in the result returned by the OCR engine because few higher scores could be returned and fewer results would be determined to be suspect.

Setting the Suspect Level to Highest would indicate you have less confidence in the result because a great number of lower scores could be returned and more results would be determined as suspect.

Support ‘filled-in error' detection

Note:

This option is enabled only when the Framed or Auto-detect radio button is selected in the Options section.

Select this check box to enable optical mark “filled in error” detection.

When this option is selected, the OCR engine will attempt to determine if a user has “crossed out” an optical mark on the document in order to indicate it is not present.

For example, users occasionally select a check box and later realize that they made an error (i.e., they meant to leave the check box unselected), so they attempt to cross it out. This option allows the OCR engine to attempt to distinguish between a selected check box and a crossed-out check box.

If the OCR engine determines that a field has been marked in error (i.e., crossed out), the field is marked as suspect.

If this option is not selected, the OCR engine will not attempt to discern the difference between a selected check box and a crossed-out check box.

Advanced filtering

Use the VB script drop-down to select a VBscript to associate with the processing of this Data Field Zone.

Click the ... button to open the VB Scripts dialog box. Here, the selected script can be re-configured or edited.

Test

Click the Test button to have the OCR engine perform a test process on the Data Field Zone and attempt to detect an optical mark using the options you have specified. If the Advanced Capture engine is configured to attempt to compensate for offset data, the resulting offset/scaling adjustments will take place during the test.

Once the test process is complete, a dialog box is displayed indicating if an optical mark was detected and the Keyword Value that would be assigned based on this result. The offset/scaling information is also displayed in the dialog box.

Keyword association

You can logically group Data Field Zones using Keyword association. These groupings can be used to identify Keywords that belong to a Multi-Instance Keyword Type Group.

Use the Keyword association field to enter or select a logical group name for the Data Field Zones that identify Keywords belonging to an MIKG. The Advanced Capture engine will attempt to maintain this Keyword grouping on the resulting document.

Colors

If you would like to assign specific colors to the Keyword Type configured for the Data Field Zone, click the Colors button. The Display Colors dialog box is displayed.

Here you can change the colors in which any regular or suspect values for the corresponding Keyword Type are displayed in the Indexing panel once Advanced Capture processing has taken place.

  • To change the display color for regular values, click Display Color to open your machine's color palette, select a color, and click OK.

  • To change the display color for suspect values, click Suspect Color to open your machine's color palette, select a color, and click OK.

  • To revert back to the default display color for regular or suspect values, click the Automatic button that corresponds to the desired type of values (i.e., the left button left for regular values, the right button for suspect values).

Note:

Any colors assigned here can be overridden by colors assigned through Keyword Lookup/Replace settings and/or VB scripting.

Activation groups

When you have configured multiple Form Identification Zones or Page Registration Zones for a document, you can assign individual Data Field Zones to a specific Form Identification or Page Registration Zone using activation groups. Activation groups allow you to activate only the Data Field Zones assigned to the Form Identification or Page Registration Zone that is used to match the document to an Advanced Capture form. Data Field Zones assigned to Form Identification or Page Registration Zones that are not used to match the document to a form will not be processed. Also, Data Field Zones present on pages other than the pages containing their assigned Form Identification or Page Registration Zones will not be processed, unless otherwise specified through the Page Location(s) setting or by adding a + to the front of the activation group name on the Form Identification or Page Registration Zone. This selective activation saves processing time and reduces the number of forms that need to be created for a Document Type.

Use the Activation groups field to enter or select an activation group name. Add a + to the front of a group name (e.g., +Group1) on a Form Identification or Page Registration Zone to set all Data Field Zones assigned to this group to be processed. Use commas to separate multiple group names.

  • When a Form Identification Zone or Page Registration Zone is matched to a form, all activation groups that have been configured for the zone will be activated.

  • Form Identification Zones are organized into Identification Groups (under Combined rule expressions), and only one Identification Group can be matched to a form. Once an Identification Group has been matched, any remaining Identification Groups on the document will be skipped.

  • Multiple Page Registration Zones can be matched to a form. Every Page Registration Zone on the document will be tested for a match.

  • If multiple activation groups have been configured for a Data Field Zone, the zone will be processed if any of these activation groups is activated.

  • If no activation groups have been configured for a Data Field Zone, the zone will be considered active and thus will be processed.

  • If a Data Field Zone is configured to only be searched for on certain pages (i.e., through the Page Location(s) setting), the zone can only be considered active on these pages. This overrides any conflicting settings that would otherwise activate the Data Field Zone (e.g., when a Data Field Zone is assigned to an activation group that is named with a + on the corresponding Form Identification or Page Registration Zone, or when a Data Field Zone is not assigned to any activation group).

Activation groups (cont.)

Alternatively, you can assign a form definition group as the Data Field Zone's activation group to activate the zone for processing. Form definition groups can be used to extract only specific types of information (e.g., header data vs. detail data) during processing. In the Activation groups drop-down list, form definition groups are enclosed in brackets (e.g., [Group1]).