Search: extract-structured-data
Last modified by admin on 2022/04/24 04:58
Refine your search
Select a category and activate filters on the current results
Location
- 146Activities
- 10akaBot Vision
- 1akaBot Studio
Last author
- 58Nhan Nguyen
- 37VuNH54
- 32Tuan Nguyen
- 18admin
- 11DatPT
- … 1 more
Creator
- 92admin
- 54Tuan Nguyen
- 10Nhan Nguyen
- 1Giang Tran
Last modification date
Creation date
Object type
Upload date
[17]Extract Structured Data
Located in
- Rendered document content
RCA.Activities.Browser.ExtractStructuredData Description The Extract Structured Data allows you to extract structured data from a specified webpage. (* For mandatory) In the body of the activity Pick
…allows you to input the page source contained the Structure Data needed to be extracted
- Title
[17]Extract Structured Data
- Location
[17]Extract Structured Data
- Raw document content
" %) ((( RCA.Activities.Browser.ExtractStructuredData == **Description** == The Extract Structured Data allows you to extract structured data from
…edit the name of the activity to organize and structure your code better. E.g: Extract Data
…(String)** - This property allows you to input the page source contained the Structure Data needed
[3] Configure Fields for Data Extraction
Located in
- Rendered document content
Each Pipeline defines the structure of Data fields that akaBot Vision extracts. Description When editing this structure you have two options: Use pre-trained Data fields – AkaBot Vision’s Generic AI engine has been pre-trained to recognize specific Data fields and enables you to start extracting data
- Title
[3] Configure Fields for Data Extraction
- Location
Customizing Data Extract
…Configuring Fields for Data Extraction
- Raw document content
="wikigeneratedid" id="HParagraph1" %) Each Pipeline defines the structure of Data fields that akaBot Vision extracts. == **Description** == (% class="wikigeneratedid" %) When editing this structure you have two
…to recognize specific Data fields and enables you to start extracting data without any additional training
[16]Extract Data
Located in
- Rendered document content
RCA.Activities.Browser.ExtractData Description The Extract Data activity allows you to get data
…file enables you to extract data from indicated webpage. The text must be quoted. E.g: "project.json
…Config Json (String) - Json file enables you to extract data from indicated webpage. The text must
- Title
[16]Extract Data
- Location
[16]Extract Data
- Raw document content
of the activity to organize and structure your code better. E.g: [342342314] Extract Data **Output
…" %) ((( RCA.Activities.Browser.ExtractData == **Description** == The Extract Data activity allows you to get data from a specified
…file enables you to extract data from indicated webpage. The text must be quoted. E.g: "project.json
[1] Create an Account
Located in
- Rendered document content
1. Create an Account Note: Although akaBot Vision currently supports Pre-trained data fields only for Invoice processing, the technology is documented agnostic and can extract data from any structured document including receipts, purchase orders, shipping documents, etc. Please contact support
- Raw document content
currently supports Pre-trained data fields only for Invoice processing, the technology is documented agnostic and can extract data from any structured document including receipts, purchase orders, shipping
…-20220420182302-1.png||alt="image-20220420183141-4.png" data-xwiki-image-style-alignment="center"]] **Step 2
[4] Capture Custom Table Data
Located in
- Rendered document content
A basic element in the extraction schema is the data field. However, akaBot Vision enables the capture of even more complex structures like tables. Adding a predefined table field If you are missing
…settings. In this tab, you can manage pre-trained data fields and select which of them should be extracted
- Title
[4] Capture Custom Table Data
- Location
Customizing Data Extract
…Capturing Custom Table Data in akaBot Vision
- Raw document content
="wikigeneratedid" id="HParagraph1" %) A basic element in the extraction schema is the data field. However, akaBot Vision enables the capture of even more complex structures like tables. == **Adding a predefined table
…of them should be extracted. (% style="text-align:center" %) [[image:image-20220421003652-1.png||data
[3] Validate the Data
Located in
- Rendered document content
When you select a document for review, it will take you to the Validation screen. In the left panel, you can see the predefined list of Data fields that have been extracted. Press Tab to check
…be able to export the data. If you realize that the data field was not extracted correctly, you can make
- Title
[3] Validate the Data
- Location
Validate the Data
- Raw document content
, otherwise, you won’t be able to export the data. If you realize that the data field was not extracted
…="font-family:Arial,Helvetica,sans-serif" %)In the left panel, you can see the predefined list of Data fields that have been extracted. (% style="font-family:Arial,Helvetica,sans-serif" %)[[[[image:image
Get Event Info
Located in
- Rendered document content
RCA.Activities.Core.GetEventInfo Description This activity allows extracting different types
…. (* for Mandatory) Properties Misc Public (Checkbox) - If you check it, the data of this activity will be shown in the log. Be careful, consider data security before using it. Display Name (String) - The name
- Raw document content
" %) ((( RCA.Activities.Core.GetEventInfo == **Description** == This activity allows extracting different types of information
…)** - If you check it, the data of this activity will be shown in the log. Be careful, consider data security
…of the activity to organize and structure your code better. E.g: Get Even Info * **Type Argument (Dropdown list
Read Text
Located in
- Rendered document content
from continuing the execution. Misc Public (Checkbox) - If you check it, the data of this activity will be shown in the log. Be careful, consider data security before using it. Display Name (String) - The name of this activity. You can edit the name of the activity to organize and structure your code better. E.g: Read Word
- Raw document content
check it, the data of this activity will be shown in the log. Be careful, consider data security before
…to organize and structure your code better. E.g: Read Word File **Output** * **Text (String) **- Context or specified characters that are extracted from the file and stored in a String variable. ))) {{velocity
[2] Upload Document
Located in
- Rendered document content
extracting the data fields. The AI extraction typically takes up to 5 seconds for 1 A4 page. After
- Raw document content
]] [[image:image-20220420184331-2.png||data-xwiki-image-style-alignment="center"]] Click “OK” and the AI will automatically begin extracting the data fields. The AI extraction typically takes up to 5 seconds for 1 A4 page. [[image:image-20220420184331-3.png||data-xwiki-image-style-alignment="center"]] After processing
[10]Get Text
Located in
- Rendered document content
RCA.Activities.Browser.GetText Description The Get Text activity extracts on a webpage and saves
…to organize and structure your code better. Eg: [3454334] Get Text Public (Checkbox) - Check if you want to public it. Remember to consider data security requirement before using it. Default is uncheck. Output
- Raw document content
" %) ((( RCA.Activities.Browser.GetText == **Description** == The Get Text activity extracts on a webpage and saves it in a String
…the name of the activity to organize and structure your code better. Eg: [3454334] Get Text * **Public (Checkbox)** - Check if you want to public it. Remember to consider data security requirement before using