The Avro Summarizer Executor can be configured using the Aspire Admin UI from the Workflow page and by clicking the workflow where the component will be used.


Step 1. Launch Aspire and Open the Content Source Management Page.

Launch Aspire (if it's not already running). See:


Step 2. Add or select a Workflow.

  • Add a new workflow or open an existing workflow.
  • For this step, please refer to the Workflow Introduction.


Step 3. Add the Avro Summarizer Executor to the Workflow.

  • Select the event for which you want to add the Avro Summarizer Executor to, from the Event combo.
  • To add the component, drag the Avro Summarizer Executor from the Rules Section on the right side of the screen and drop it below the Workflow Event to the left side of the screen. This will automatically open the Avro Summarizer Executor window for the configuration of the component.

Step 3a. Specify a description for the application.

 In the top section of the Avro Summarizer Executor configuration window, specify the description for the application.


Step 3b. Specify the executor configuration.

  1. Add Schema: If enabled, the table schema will be added to the processed columns.
  2. Debug: If checked, debug messages will be enabled.
  3. Thread Pool: The number of threads to use for parallel processing.
  4. Processed Rows Log Frequency: The frequency for reporting the processed rows.
  5. Use row sampling: Enable to process only a random sample of the table rows. This option could increases the memory usage.
    1. Minimum of samples to gather: The minimum of randoms samples that will be gathered from the table.
    2. Maximum of samples to gather: The Maximum of randoms samples that will be gathered from the table.
    3. Minimum percentage to process: The minimum percentage of the total rows to process from table.
    4. Limit the number of rows to read: Enable to limit how many rows from the table will be read.
      1. Maximum number of rows to read: The maximum of row from the table that will be read.
  6. Use row filter: Check to filter the rows to process.
    1. Use groovy file: Check to use a groovy file to filter.
      1. Use Local Script File: Check to specify a script file or disable to specify an uploaded resource file.
        1. Groovy Script Path: The path of the groovy script that contains the filter logic.  It must return a boolean value, if true the row will be filtered.
      2. Filter Script: Script used to filter the rows. It must return a boolean value, if true the row will be filtered.





  • No labels