Create Parquet Summarizer Executor

Field	Required	Default	Multiple	Notes	Example
type	Yes	-	No	The value must be "application".	"application"
_type	Yes	-	No	The value must be "application".	"application"
appName	Yes	-	No	The name of the application	"Parquet-Executor"
appType	Yes	-	No	The value must be "parquet-summarize-executor".	"parquet-summarize-executor"
config	Yes	-	No	The value must be "com.accenture.aspire:app-parquetsummarize-executor".	"com.accenture.aspire:app-parquetsummarize-executor"
description	Yes	-	No	The description	"Parquet-Executor"
properties	Yes	-	No	Configuration object
addSchema	Yes	true	No	If enabled, the table schema will be added to the processed columns.	true
useTempFile	Yes	true	No	Enable to download the content stream to a temporary file before processing it.	true
threadPool	Yes	5	No	The number of threads to use for parallel processing.	5
logFrequency	Yes	1000	No	The frequency for reporting the processed rows.	1000
filterRows	Yes	false	No	Enable to filter the rows to process.	true
useFilterFile	Yes	true	No	Enable to use a groovy file to filter the rows.	true
groovyPath	No	-	No	The path of the groovy script that contains the filter logic. It must return a boolean value. If true, the row will be filtered.	"C:\\Aspire\\config\\rowsGroovyFilter.txt"
groovyScript	No	-	No	Script used to filter the rows. It must return a boolean value. If true, the row will be filtered.	"row.getBoolean(\"sensitive\") == true"

Example

POST /aspire/_api/workflows/{workflow}/rules

{
  "type": "application",
  "_type": "application",
  "description": "Parquet-Executor",
  "config": "com.accenture.aspire:app-parquetsummarize-executor",
  "appType": "parquet-summarize-executor",
  "appName": "Parquet Summarize Executor",
  "properties": {
    "addSchema": true,
    "useTempFile": true,
    "debug": false,
    "threadPool": 5,
    "logFrequency": 1000,
    "filterRows": true,
    "useFilterFile": false,
    "groovyScript": "// This script must return a boolean.\n// The references of the job, doc, component, row and table objects are available.\n// Javadoc references \n// Row (row) - http://{manager}/javadocs/com/accenture/aspire/services/summarization/Row.html\n// Table (table) - http://{manager}/javadocs/com/accenture/aspire/services/summarization/Table.html\nrow.getBoolean(\"sensitive\") == true"
  }
}

Update Parquet Summarizer Executor

Field	Required	Default	Multiple	Notes	Example
id	Yes	-	No	ID of the application to update	"61014782-442a-4587-ab85-ba1439a7f7b5"
type	Yes	-	No	The value must be "application".	"application"
_type	Yes	-	No	The value must be "application".	"application"
appName	Yes	-	No	The name of the application	"Parquet-Executor"
appType	Yes	-	No	The value must be "parquet-summarize-executor".	"parquet-summarize-executor"
config	Yes	-	No	The value must be "com.accenture.aspire:app-parquetsummarize-executor".	"com.accenture.aspire:app-parquetsummarize-executor"
description	Yes	-	No	The description	"Parquet-Executor"
properties	Yes	-	No	Configuration object
addSchema	Yes	true	No	If enabled, the table schema will be added to the processed columns.	true
useTempFile	Yes	true	No	Enable to download the content stream to a temporary file before processing it.	true
threadPool	Yes	5	No	The number of threads to use for parallel processing.	5
logFrequency	Yes	1000	No	The frequency for reporting the processed rows.	1000
filterRows	Yes	false	No	Enable to filter the rows to process.	true
useFilterFile	Yes	true	No	Enable to use a groovy file to filter the rows	true
groovyPath	No	-	No	The path of the groovy script that contains the filter logic. It must return a boolean value. If true, the row will be filtered.	"C:\\Aspire\\config\\rowsGroovyFilter.txt"
groovyScript	No	-	No	Script used to filter the rows. It must return a boolean value. If true, the row will be filtered.	"row.getBoolean(\"sensitive\") == true"

Example

PUT /aspire/_api/workflows/{workflow}/rules/{id}

{
  "id": "61014782-442a-4587-ab85-ba1439a7f7b5", 
   "type": "application",
  "_type": "application",
  "description": "Parquet-Executor",
  "config": "com.accenture.aspire:app-parquetsummarize-executor",
  "appType": "parquet-summarize-executor",
  "appName": "Parquet Summarize Executor",
  "properties": {
    "addSchema": true,
    "useTempFile": true,
    "debug": false,
    "threadPool": 5,
    "logFrequency": 1000,
    "filterRows": true,
    "useFilterFile": false,
    "groovyScript": "// This script must return a boolean.\n// The references of the job, doc, component, row and table objects are available.\n// Javadoc references \n// Row (row) - http://{manager}/javadocs/com/accenture/aspire/services/summarization/Row.html\n// Table (table) - http://{manager}/javadocs/com/accenture/aspire/services/summarization/Table.html\nrow.getBoolean(\"sensitive\") == true"
  }
}

Page tree

Create Parquet Summarizer Executor

Example

Update Parquet Summarizer Executor

Example

Contact Us: [email protected]

Page tree

Rest API - Parquet Summarizer Executor

Create Parquet Summarizer Executor

Example

Update Parquet Summarizer Executor

Example

Contact Us: [email protected]