Recognizer Options - Full-Text Search - English - Foundation 22.1 - OnBase - external

Full-Text Search

Platform
OnBase
Product
Full-Text Search
Release
Foundation 22.1
License

The options in the Recognizer Options section allow you to specify the processing options used by the OCR engine.

Recognizer Option

Description

Automatic zoning

Allows the OCR engine to automatically determine which algorithm will be used for page parsing. Slows processing time.

Standard zoning

Uses the same zoning as ScanSoft Omnipage 12. This is the most accurate, but slowest, page parser.

Fast zoning

This is the fastest, but least accurate, page parser. Use only for simple page parsing tasks (i.e., few graphics and mostly text).

Legacy zoning

Uses the original zoning first implemented with the OCR engine, for simple page decomposition. This is faster, but less accurate, than Standard zoning.

Nongridded table detect

Preserves nongridded formatting. Slows process speed.

Force single column

Converts multiple columns on a single page into a single column.

Document-at-once method

Sends the document to the OCR engine all at once instead of page by page.

Note:

This option must be deselected in order to OCR color images.

Note:

Certain documents have been found to be incompatible with the Document-at-once method option. If the OCR process repeatedly fails when attempting to OCR the same types of documents, deselect this option.

Tip:

Select this option if you are getting an error when attempting to output to a searchable PDF format.

Direct output (faster)

Speeds up an OCR process that is configured to create any of the following output formats:

  • ASCII text (standard or formatted)
  • PDF (image with searchable text)
  • Unicode text (standard or formatted)

The Direct output (faster) option is not available when any of the following Output format options is selected:

  • PDF (Standard)
  • PDF (Image Substitutes)
  • HTML 3.2
  • HTML 4.0
  • Microsoft Word 2003 (DOC)
  • Microsoft Word 2007 (DOCX)
  • Rich Text Format
Note:

Certain documents have been found to be incompatible with the Direct output (faster) option. If the OCR process repeatedly fails when attempting to OCR the same types of documents, deselect this option.

Most accurate

Most accurate, but slowest, recognition.

Balance accuracy / speed

Mid level accuracy/speed recognition.

Fastest

Fastest, but least accurate, recognition.

Tip:

Due to improvements in the OCR engine used in OnBase 7.2 and later, OCR processing times may be slightly longer than those experienced in previous versions of OnBase. To increase the speed of the process configured to use an ASCII text output format, select the Direct output (faster) check box. Or, to increase the speed of any OCR process, select the Fast Zoning and/or the Fastest radio buttons. Be aware, however, that the accuracy of the process may be reduced if the Fast Zoning and/or Fastest settings are selected.