Search: extract-structured-data
Last modified by admin on 2022/04/24 04:58
Refine your search
Select a category and activate filters on the current results
Location
- 65Activities
- 10akaBot Vision
Last author
- 39Nhan Nguyen
- 24VuNH54
- 10admin
- 1DatPT
- 1Giang Tran
Creator
- 65admin
- 9Nhan Nguyen
- 1Giang Tran
Last modification date
Creation date
Object type
Upload date
[3] Configure Fields for Data Extraction
Located in
- Rendered document content
Each Pipeline defines the structure of Data fields that akaBot Vision extracts. Description When editing this structure you have two options: Use pre-trained Data fields – AkaBot Vision’s Generic AI engine has been pre-trained to recognize specific Data fields and enables you to start extracting data
- Title
[3] Configure Fields for Data Extraction
- Location
Customizing Data Extract
…Configuring Fields for Data Extraction
- Raw document content
="wikigeneratedid" id="HParagraph1" %) Each Pipeline defines the structure of Data fields that akaBot Vision extracts. == **Description** == (% class="wikigeneratedid" %) When editing this structure you have two
…to recognize specific Data fields and enables you to start extracting data without any additional training
[1] Create an Account
Located in
- Rendered document content
1. Create an Account Note: Although akaBot Vision currently supports Pre-trained data fields only for Invoice processing, the technology is documented agnostic and can extract data from any structured document including receipts, purchase orders, shipping documents, etc. Please contact support
- Raw document content
currently supports Pre-trained data fields only for Invoice processing, the technology is documented agnostic and can extract data from any structured document including receipts, purchase orders, shipping
…-20220420182302-1.png||alt="image-20220420183141-4.png" data-xwiki-image-style-alignment="center"]] **Step 2
[4] Capture Custom Table Data
Located in
- Rendered document content
A basic element in the extraction schema is the data field. However, akaBot Vision enables the capture of even more complex structures like tables. Adding a predefined table field If you are missing
…settings. In this tab, you can manage pre-trained data fields and select which of them should be extracted
- Title
[4] Capture Custom Table Data
- Location
Customizing Data Extract
…Capturing Custom Table Data in akaBot Vision
- Raw document content
="wikigeneratedid" id="HParagraph1" %) A basic element in the extraction schema is the data field. However, akaBot Vision enables the capture of even more complex structures like tables. == **Adding a predefined table
…of them should be extracted. (% style="text-align:center" %) [[image:image-20220421003652-1.png||data
[3] Validate the Data
Located in
- Rendered document content
When you select a document for review, it will take you to the Validation screen. In the left panel, you can see the predefined list of Data fields that have been extracted. Press Tab to check
…be able to export the data. If you realize that the data field was not extracted correctly, you can make
- Title
[3] Validate the Data
- Location
Validate the Data
- Raw document content
, otherwise, you won’t be able to export the data. If you realize that the data field was not extracted
…="font-family:Arial,Helvetica,sans-serif" %)In the left panel, you can see the predefined list of Data fields that have been extracted. (% style="font-family:Arial,Helvetica,sans-serif" %)[[[[image:image
Get Event Info
Located in
- Rendered document content
RCA.Activities.Core.GetEventInfo Description This activity allows extracting different types
…. (* for Mandatory) Properties Misc Public (Checkbox) - If you check it, the data of this activity will be shown in the log. Be careful, consider data security before using it. Display Name (String) - The name
- Raw document content
" %) ((( RCA.Activities.Core.GetEventInfo == **Description** == This activity allows extracting different types of information
…)** - If you check it, the data of this activity will be shown in the log. Be careful, consider data security
…of the activity to organize and structure your code better. E.g: Get Even Info * **Type Argument (Dropdown list
Read Text
Located in
- Rendered document content
from continuing the execution. Misc Public (Checkbox) - If you check it, the data of this activity will be shown in the log. Be careful, consider data security before using it. Display Name (String) - The name of this activity. You can edit the name of the activity to organize and structure your code better. E.g: Read Word
- Raw document content
check it, the data of this activity will be shown in the log. Be careful, consider data security before
…to organize and structure your code better. E.g: Read Word File **Output** * **Text (String) **- Context or specified characters that are extracted from the file and stored in a String variable. ))) {{velocity
[2] Upload Document
Located in
- Rendered document content
extracting the data fields. The AI extraction typically takes up to 5 seconds for 1 A4 page. After
- Raw document content
]] [[image:image-20220420184331-2.png||data-xwiki-image-style-alignment="center"]] Click “OK” and the AI will automatically begin extracting the data fields. The AI extraction typically takes up to 5 seconds for 1 A4 page. [[image:image-20220420184331-3.png||data-xwiki-image-style-alignment="center"]] After processing
Get Table
Located in
- Rendered document content
of the activity to organize and structure your code better. E.g: Get the data table Public (Checkbox) - If you check it, the data of this activity will be shown in the log. Be careful, consider data security before using it. Output Data Table (DataTable)- The Table is extracted from the Word Document and stored
- Raw document content
of this activity. You can edit the name of the activity to organize and structure your code better. E.g: Get the data table * **Public (Checkbox) **- If you check it, the data of this activity will be shown in the log. Be careful, consider data security before using it. **Output** * **Data Table (DataTable
Execute XPath
Located in
- Rendered document content
. This field supports only strings and string variables. Misc Public (Checkbox) - If you check it, the data of this activity will be shown in the log. Be careful, consider data security before using it. Display Name (String) - The name of this activity. You can edit the name of the activity to organize and structure your code better
- Raw document content
. **Misc** * **Public (Checkbox)** - If you check it, the data of this activity will be shown in the log. Be careful, consider data security before using it. * **Display Name (String) **- The name of this activity. You can edit the name of the activity to organize and structure your code better. E.g: Execute XPath
Get Elements
Located in
- Rendered document content
RCA.Activities.IE.GetElements Description Extracts the UI element on Internet Explorer. Activity
…it. Remember to consider data security requirements before using it. Display Name (String) - The name of this activity. You can edit the name of the activity to organize and structure your code better. Ex: [2424234
- Raw document content
" %) ((( RCA.Activities.IE.GetElements == **Description** == Extracts the UI element on Internet Explorer. Activity is only valid
…) **- Check if you want to publicize it. Remember to consider data security requirements before using
…and structure your code better. Ex: [2424234] Get Elements. **Output** * **Elements (IEElement