Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.


Panel
titleOn this page

Table of Contents

Step 1. Launch Aspire and open the Content Source Management Page

Step 2. Add a new Content Source

Step 3. Add a new Parquet Extractor to the Workflow

  • Step 3a. Specify Application Information
  • Step 3b. Share rule to a library
  • Step 3c. Copy the rule from the shared library

    Image Modified
    Anchor
    Step1
    Step1
    Step 1. Launch Aspire and

    open

    Open the Content Source Management Page

    Launch Aspire (if it's not already running). See:

    Anchor
    Step2
    Step2
    Step 2. Add a new Content Source

    • For this step please follow the step from the Configuration Tutorial of the connector of you choice, please refer to Connector list


    Anchor
    Step3
    Step3
    Step 3. Add a new Parquet Extractor to the Workflow

    To add a Parquet Extractor application drag from the Parquet Extractor rule from the Workflow Library and drop to the Workflow Tree where you want to add it. This will automatically open the Parquet Extractor window for the configuration of the application.


    Anchor
    Step3a
    Step3a
    Step 3a. Specify Application InformationImage Modified

     In the Parquet Extractor window, specify the configuration information of the application.

    1. General configuration
      1. No ids will be stored in NoSql - Check if you do not want to store ids in NoSql. Warning! Aspire will not be able to delete items in the index in future crawls.
      2. NoSQL Bulk Size - The size of the bulk write operations done to the NoSql Database
      3. NoSQL Bulk Timeout - The amount of time to wait before flushing the bulk operations after the last insert
      4. No info messages - Check if you want info messages disabled
      5. Sub Job timeout.
        1. Time in milliseconds to wait before the current job is killed for inactivity.
        2. Example: 60000
      6. Debug:
        1. Enable debug messages.
    2. Routing
      1. Workflow for add/update jobs:
        1. Workflow to send the generated add or update jobs.
      2. Workflow for delete jobs:
        1. Workflow to send the generated delete jobs.
      3. Workflow for error jobs:
        1. Workflow to send the generated error jobs.


    Image Modified
    Anchor
    Step3b
    Step3b
    Step 3b. Share rule to a library

    Once the application is set, it must be shared to a (new or existing) library.

    Note: This is a required step. 

    Anchor
    Step3c
    Step3c
    Step 3c. Copy the rule from the shared library

    Add the shared application from the library to the Delete workflow.

    Note: This is a required step. 


    Warning

    In order to work, the application requires to disable the ExtractText stage performed by the connector on the connector's Advanced Properties.

    Once you've clicked on the Add button, it will take a moment for Aspire to download all of the necessary components (the Jar files) from the Maven repository and load them into Aspire. Once that's done, the application will appear in the Workflow Tree.

    Info

    For details on using the Workflow section, please refer to Workflow introduction.