Page History

Step 3b. Specify the executor configuration.

Root Node: The root node which contains the sub-jobs to publish. If not specified, the root node of the entire XML tree is considered to be the root node.
Character Encoding: The character encoding of the XML file to be read, if not UTF-8.
Cleanse: Enable if you want to clean the XML content from non-readable characters (.i.e ASCII code 15).
Honor DTD: Enable if you want to fetch XML's DTD.
Limit nested structures to flatten: Enable to limit how many levels in a nested structures should be flattened.
1. Maximum nested level: The maximum nested level to be flatten.
Limit the number of arrays entries to process: Enable to limit how many entries in array structures should be processed.
1. Maximum number of entries: The maximum number of array entries to process.
Debug: Debug messages will be enabled
Add Schema: If enabled, the table schema will be added to the processed columns.
Use Temp File: Enable to download the content stream to a temporary file before processing it.
Thread Pool: The number of threads to use for parallel processing.
Processed Rows Log Frequency: The frequency for reporting the processed rows.
Use row sampling: Enable to process only a random sample of the table rows. This option could increase the memory usage.
1. Minimum of samples to gather: The minimum of randoms samples that will be gathered from the table.
2. Maximum of samples to gather: The Maximum of randoms samples that will be gathered from the table.
3. Minimum percentage to process: The minimum percentage of the total rows to process from table.
4. Limit the number of rows to read: Enable to limit how many rows from the table will be read.
  1. Maximum number of rows to read: The maximum of row from the table that will be read.
Use row filter: Check to filter the rows to process.
1. Use groovy file: Enable to use a groovy file to filter the rows.
  1. Groovy Script Path: The path of the groovy script that contains the filter logic. It must return a boolean value, if true, the row will be filtered.
  2. Filter Script: Script used to filter the rows. It must return a boolean value, if true, the row will be filtered.