Extract Structured Data

Last modified by Nga Nguyen on 2022/05/16 04:07

RCA.Activities.Common.ExtractStructuredData

Description

The Extract Structured Data allows you to extract structured data from a specified webpage.

image-20220505134517-1.png

(*For mandatory)

In the body of activity

  • Open Wizard* - Chooses the element to verify its existence. This activity will generate a string variable (Selector) to specify the location of that element.

Properties

Common

  • Continue On Error (Boolean) - This property specifies when the automation keeps going if it has an error. Only have two possible values: True or False. True - allows the rest of the process to continue the execution even an error occurs within the activity. False (default) - blocks the process from continuing the execution.
  • Delay After (Int32) - Delay time (in milliseconds) after executing the activity. The default amount of time is 300 milliseconds.
    E.g: 300
  • Delay Before (Int32) - Delay time (in milliseconds) before the activity begins performing any operations. The default amount of time is 200 milliseconds.
    E.g: 200

Input

  • Selector (String) - The outputted data with type 'DataTable'.

Misc

  • Display Name (String) - The name of this activity. You can edit the name of the activity to organize and structure your code better.
  • Public (Checkbox) - Check if you want to public it. Remember to consider data security requirement before using it. Default is uncheck.

Options

  • Delay Between Pages MS (Int32) – The amount of time (in milliseconds) to wait until the next page is loaded. The default value is 1000 milliseconds. If the loading time of the page is longer, this value should be higher.
    E.g: 1000
  • Max Number Of Result (Int32) – The maximum number of results to be extracted. If the value is 0, all the identified elements are added to the output. The default value is 100.
    E.g: 100
  • Next Page Timeout MS (Int32) - The amount of time (in milliseconds) to wait for the next page to load. If the timeout has expired, the page would not be loaded further. The default amount of time is 30000 milliseconds. If the input value is less than or equal to 0, the timeout will be 30000 (milliseconds).
    E.g: 30000

Output

  • Result (Data Table) - The outputted data with type = ‘DataTable’.
  
Navigation