Full-Text Renditions Do Not OCR Correctly - Full-Text Search - English - Foundation 22.1 - OnBase - external

Full-Text Search

Platform
OnBase
Product
Full-Text Search
Release
Foundation 22.1
License

The OCR Engine used by Full-Text Search is optimized for best results. OCR format settings can be configured in the OnBase Configuration module for the Document Types indexed.

Note:

The OCR process is only performed for documents indexed with pagination enabled. Text documents do not use an OCR format on indexing because text is always indexed using a proprietary text parser.

If no other OCR settings are configured for a Document Type being indexed, the following settings are used by the Full-Text Search:

  • Full Text (Index) <Default>: The settings optimized for indexing documents.

  • Full Text (View) <Default>: The settings optimized for viewing documents.

The following OCR format settings are not supported by Full-Text Search:

  • Output Format

  • Do not OCR PDF documents

  • Create PDF/A compatible output

  • Report per-page timing statistics

Language settings are respected. The default language is English, so non-English languages should change the default language to match the language of the documents being indexed.

It is a best practice to ensure that all documents are rotated for natural, top-down viewing. If the OCR format for the Document Type containing a rotated document does not have rotation enabled, search results may not be highlighted correctly.

Tip:

For complete details on adjusting the OCR settings, see the Configuring OCR Formats appendix in the Full-Text Search module reference guide.