Get Text from PDF - Activities - Foundation 23.2 - Foundation 23.2 - Ready - Hyland RPA - external

Hyland RPA Activities

Platform
Hyland RPA
Product
Activities
Release
Foundation 23.2
License

Applications.Zip

Description

Returns the text of a PDF file.

Properties

Analyst

  • Description: Optional text for documentation purposes.

Common

  • Continue On Error: Continues the execution of the workflow even if an error occurs during the execution of the activity.

  • Delay Before: Delay time in milliseconds before the activity is executed. The default value is 250 milliseconds.

  • Delay After: Delay time in milliseconds after executing of the activity. The default value is 250 milliseconds.

  • Disable Log: Disables the logging functionality for this activity.

  • Disable Protocol: Disables the protocol functionality for this activity.

  • Exclude from Global Delay: Excludes this activity from the Global Delay functionality.

  • Timeout: Duration in milliseconds in which the activity tries to execute.

Input

  • Pages: Enter a range of the pages you want to read for example, "1" for the first page of the document, "1-4" for pages 1,2,3, and 4 or "2,4" to read pages 2 and 4 only. Combinations like "1,4-6" are also possible. Leave blank to read all pages.

Output

  • All Pages Are Readable: Returns false if not all pages are readable.
  • Content: Returns the text context of all specified pages.
  • Forms Content: Returns the values set in PDF forms objects. PDF forms are input fields that canĀ“t be read directly like normal text. Use the Dictionary output with For each KeyValuePair to get the value.
    Note: The input can be null.
  • Page Content: Stores the text content for each individual page.