An OMR Data Field Zone is configured using the options on the Optical Mark Field tab of the Data Field Zone dialog box.
Data Field Zone Optical Mark Field Options |
Description |
---|---|
Keyword type |
Using this drop-down list, select the Keyword Type that this Data Field Zone will assign a Keyword Value to. If the currently-displayed Advanced Capture form has an assigned Document Type, only the Keyword Types associated with the assigned Document Type are available in the drop-down list. If no Document Type is assigned to the currently-displayed Advanced Capture form, only the <None> option is available in the drop-down list. Note:
If the Keyword Value identified by the OCR engine exceeds the maximum length of the Keyword Type it is assigned to, then the Keyword Value is truncated to fit this length. |
XML Node |
Note:
This field is only displayed if the Advanced Capture form is configured to create an XML rendition of documents matched to it (i.e., the Create XML data rendition option is selected for the Advanced Capture form). Note:
This field is disabled if the OMR item group mode check box is selected. Enter the name of the XML node (i.e., the element) that Keyword Values extracted from this zone are contained in when the XML rendition of the document is created. Note:
While the XML Node name may include alphanumeric characters, underscores (_), hyphens (-), or periods (.), it must begin with either a letter or an underscore. |
OMR item group mode |
Note:
To use this feature, multiple optical marks must be reside in the Data Field Zone. Select this check box to allow the OCR engine to process multiple optical marks inside the Data Field Zone. Each optical mark must share all of the same configuration options (e.g., Keyword Type, Framed/Unframed, Sensitivity level, etc.) except for their associated positive and negative Keyword Values. When this check box is selected, the Keyword Values section of the Data Field Zone dialog box is displayed as a list containing Item #, Positive Value, and Negative Value, and an image snippet of the area of the document selected for the Data Field Zone is displayed next to the Data Field Zone dialog box. From the image snippet, use the pointer to draw a box around the first optical mark in the Data Field Zone. The Modify Keyword Result for OMR Group Item dialog box is displayed, allowing you to specify the positive and negative Keyword Value for that optical mark. Click Save when finished. To re-open the dialog box and edit these values once they have been configured, double-click on the corresponding entry in the table, make the desired changes, and click Save. If the Advanced Capture form is configured to create XML renditions of the documents it is matched to, the XML Node field is displayed, giving you the opportunity to name the XML node that the Keyword Values are contained in when the XML rendition is created. When the OMR item group mode option is selected, the Minimum Positive and Maximum Positive options are displayed between the Sensitivity and Advanced filtering options. In the Minimum Positive and Maximum Positive fields, you can specify the minimum and maximum number of optical marks that the OCR engine should detect within the Data Field Zone, respectively. If the number of optical marks detected is less than the specified minimum or greater than the specified maximum value, the values for the Data Field Zone will be automatically marked as suspect. |
OMR item group mode (cont.) |
For example, if a form has two check boxes for gender, and you only expect one box to be checked (i.e., either male or female), you can specify both the Minimum Positive and Maximum Positive values to be 1. If neither box is checked, or if both boxes are checked, the values for the zone will be marked as suspect. By default, the Minimum Positive and Maximum Positive values are both set to 0, which keeps minimum/maximum optical mark validation disabled. When finished, click Save. The optical mark configuration zone is highlighted on the image snippet. Right-click on the optical mark configuration within the image snippet to move, resize, or delete the configuration information for that optical mark configuration zone. Repeat this process for each optical mark displayed in the Data Field Zone. |
Keyword Values |
In the Positive Result Value field, enter the Keyword Value that is assigned if the OCR engine determines that an optical mark is present. In the Negative Result Value field, enter the Keyword Value that is assigned if the OCR engine determines that an optical mark is not present. For example: you are processing an application and one of the questions asks the applicant to select a check box if he/she has previously applied to the university. Depending on the answer to this question, a Keyword Value is assigned to the Previously Applied Keyword Type.
|
Page Location(s) |
The Page Location(s) options control the pages that the OCR engine searches for a particular Data Field Zone.
|
Mark Frame Detection |
Select the radio button that describes the optical mark being read by the OCR engine.
Tip:
Selecting the Framed or Unframed selection, where appropriate, will help to increase the accuracy of the Advanced Capture process. |
Sensitivity |
Select a radio button to determine the Suspect Level for this Data Field Zone.
The Suspect Level is the level of confidence placed in the processing results for this field. After a zone is processed, the OCR engine gives the resulting value a score between 1 and 99, depending on how confident it is in the result that was returned. The higher the score is, the lower the OCR engine's confidence is in the results. The Sensitivity level selected for this field is the threshold at which the OCR engine determines if a returned value is acceptable or suspect. A score returned by the OCR engine higher than the Suspect Level threshold you set causes the value captured from the zone to be marked as suspect. All scores lower than the Suspect Level threshold indicate that the captured value is considered by the OCR engine to be acceptable. For example, setting the Sensitivity to Lowest would indicate you have a fair amount of confidence in the result returned by the OCR engine because few higher scores could be returned and fewer results would be determined to be suspect. Setting the Suspect Level to Highest would indicate you have less confidence in the result because a great number of lower scores could be returned and more results would be determined as suspect. |
Support ‘filled-in error' detection |
Note:
This option is enabled only when the Framed or Auto-detect radio button is selected in the Options section. Select this check box to enable optical mark “filled in error” detection. When this option is selected, the OCR engine will attempt to determine if a user has “crossed out” an optical mark on the document in order to indicate it is not present. For example, users occasionally select a check box and later realize that they made an error (i.e., they meant to leave the check box unselected), so they attempt to cross it out. This option allows the OCR engine to attempt to distinguish between a selected check box and a crossed-out check box. If the OCR engine determines that a field has been marked in error (i.e., crossed out), the field is marked as suspect. If this option is not selected, the OCR engine will not attempt to discern the difference between a selected check box and a crossed-out check box. |
Advanced filtering |
Use the VB script drop-down to select a VBscript to associate with the processing of this Data Field Zone. Click the ... button to open the VB Scripts dialog box. Here, the selected script can be re-configured or edited. |
Test |
Click the Test button to have the OCR engine perform a test process on the Data Field Zone and attempt to detect an optical mark using the options you have specified. If the Advanced Capture engine is configured to attempt to compensate for offset data, the resulting offset/scaling adjustments will take place during the test. Once the test process is complete, a dialog box is displayed indicating if an optical mark was detected and the Keyword Value that would be assigned based on this result. The offset/scaling information is also displayed in the dialog box. |
Keyword association |
You can logically group Data Field Zones using Keyword association. These groupings can be used to identify Keywords that belong to a Multi-Instance Keyword Type Group. Use the Keyword association field to enter or select a logical group name for the Data Field Zones that identify Keywords belonging to an MIKG. The Advanced Capture engine will attempt to maintain this Keyword grouping on the resulting document. |
Colors |
If you would like to assign specific colors to the Keyword Type configured for the Data Field Zone, click the Colors button. The Display Colors dialog box is displayed. Here you can change the colors in which any regular or suspect values for the corresponding Keyword Type are displayed in the Indexing panel once Advanced Capture processing has taken place.
Note:
Any colors assigned here can be overridden by colors assigned through Keyword Lookup/Replace settings and/or VB scripting. |
Activation groups |
When you have configured multiple Form Identification Zones or Page Registration Zones for a document, you can assign individual Data Field Zones to a specific Form Identification or Page Registration Zone using activation groups. Activation groups allow you to activate only the Data Field Zones assigned to the Form Identification or Page Registration Zone that is used to match the document to an Advanced Capture form. Data Field Zones assigned to Form Identification or Page Registration Zones that are not used to match the document to a form will not be processed. Also, Data Field Zones present on pages other than the pages containing their assigned Form Identification or Page Registration Zones will not be processed, unless otherwise specified through the Page Location(s) setting or by adding a + to the front of the activation group name on the Form Identification or Page Registration Zone. This selective activation saves processing time and reduces the number of forms that need to be created for a Document Type. Use the Activation groups field to enter or select an activation group name. Add a + to the front of a group name (e.g., +Group1) on a Form Identification or Page Registration Zone to set all Data Field Zones assigned to this group to be processed. Use commas to separate multiple group names.
|
Activation groups (cont.) |
Alternatively, you can assign a form definition group as the Data Field Zone's activation group to activate the zone for processing. Form definition groups can be used to extract only specific types of information (e.g., header data vs. detail data) during processing. In the Activation groups drop-down list, form definition groups are enclosed in brackets (e.g., [Group1]). |