OCR Activities - Designer - Foundation 24.1 - Foundation 24.1 - Ready - Hyland RPA - external - Hyland-RPA/Designer/Foundation-24.1/Hyland-RPA-Designer/Image-and-Text-Automation/OCR-Activities - 2025-04-03

Hyland RPA Designer

Platform
Hyland RPA
Product
Designer
Release
Foundation 24.1
License

Sometimes there is no way to indicate an application with the supplied automation providers. In this case, OCR activities provide an alternative approach.

Sometimes it is not possible to indicate an application with the provided automation providers. In this case, OCR activities provide an alternative approach.

OCR stands for Optical Character Recognition and provides the ability to read text from images without directly accessing the text field. OCR is not only used to read text from an image, but it can also be used to search for specific text in an image and click or hover over it. In addition, it is possible to wait for specific text to appear or disappear with OCR.

Activity Description
Get Text by Anchor Get the text of a defined area related to an anchor.
OCR Click Click on the specified text.
Hover Text Hover over the specific text.
Wait Text Appear Waits a defined time for a certain text to appear (base of a decision).
Wait Text Vanish Waits a defined time for a certain text to vanish (base of a decision).
Get Text From PDF with OCR Returns the text of a PDF by using optical character recognition

All OCR activities provide the ability to change the OCR Engine activity. This engine provides and executes the actual OCR methodology.

Hyland RPA provides the Tesseract Engine activity within the OCR activities by default. However, the user is free to choose a different (custom) OCR engine.

The OCR activity itself is responsible for providing the engine with the required image from which to read the specific text and perform the desired action. Below you can see the OCR Click activity with and without the Tesseract Engine activity.

Above you can see the OCR Click activity with and without the Tesseract Engine activity

Anchor Image

Speaking of the provided image, in all OCR activities the user must define an anchor region and the region where the specific text is expected. The following screenshot demonstrates this methodology:

The upper dotted line represents the anchor image to which the OCR activity is directed. It will first search for the HYLAND image. The lower dotted line is the area we want to read out with OCR.