Format Retention - Full-Text Search - English - Foundation 22.1 - OnBase - external

Full-Text Search

Platform
OnBase
Product
Full-Text Search
Release
Foundation 22.1
License

Format retention is the level of formatting retained in the document. The Format Retention drop-down is active for all documents except ASCII Text (Standard) documents.

Retention Option

Description

True Page

Select the True Page option to keep the original layout of the pages.

Formatted Text

Select the Formatted Text option to retain the formatting information for fonts and paragraphs, but ignore layout formatting.

Plain Text

Select the Plain Text option to ignore all formatting information.

Flowing Page

Select the Flowing Page option to preserve the original layout of the pages, including column layouts.

Respect lines per page

When this option is selected, if the number of lines of text on a page exceeds the Document Type's Lines per page setting, the excess lines are displayed in the right margin of the current page instead of being moved to the next page of the text document. This ensures a one-to-one correspondence between information stored on the page of the image and text documents.

If this option is not selected, text documents adhere to the Lines per page setting, pushing the excess lines of text to the next page of the text document.

Do not OCR PDF documents

When selected, PDFs in batches are skipped by the OCR process.

Create PDF/A compatible output

When a PDF output format is selected, select this check box to create PDF documents that conform to the PDF/A standard.

  • To create PDF documents that conform to PDF/A-3a, select 3a.

  • To create PDF documents that conform to PDF/A-3u, select 3u.

Overwrite Revisionable Rendition

When this option is selected, if the Document Type is configured to allow revisions and the document being OCRed already has a rendition of the desired OCR output format, the existing OCR rendition is overwritten by the new rendition. When opening the document, the original image document is still the primary display document.

In effect, selecting this option instructs the OCR engine to treat the Document Type as if it were not configured to allow revisions.