Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Panel
titleOn this page

Table of Contents


Anchor
Step1
Step1

  • Step 1. Launch Aspire and open the Content Source Management Page
  • Step 2. Add a new Content Source
  • Step 3. Add a new Parquet Extractor to the Workflow
  • Step 3a. Specify Application Information
  • Step 3b. Share rule to a library
  • Step 3c. Copy the rule from the shared library
    Image Removed

    Step 1. Launch Aspire and

    open

    Open the Content Source Management Page

    Launch Aspire (if it's not already running). See:

    Image Added

    Anchor
    Step2
    Step2
    Step 2. Add a new Content Source

    • For this step please follow the step from the Configuration Tutorial of the connector of you choice, please refer to Connector list


    Anchor
    Step3
    Step3

    Image Removed

    Step 3. Add a new Parquet Extractor to the Workflow

    To add a Parquet Extractor application drag from the Parquet Extractor rule from the Workflow Library and drop to the Workflow Tree where you want to add it. This will automatically open the Parquet Extractor window for the configuration of the application.


    Anchor
    Step3a
    Step3a
    Step 3a. Specify Application Information

     In the Parquet Extractor window, specify the configuration information of the application.

    1. General configuration
      1. Do not store ids - Check if you do not want to store ids in NoSql. Warning! Aspire will not be able to delete items in the index in future crawls.
      2. Sub Job timeout
    .
  • Time in milliseconds to wait before the current job is killed for inactivity.
      1. - Time out for secondary jobs in milliseconds..
        1. Example: 60000
      2. No info messages - Check if you want info messages disabled
      3. Batch Size - Sub job batch size.
      4. Batch Timeout - Sub job batch timeout in milliseconds.
    Example: 60000
      1. Debug:
        1. Enable debug messages.
    1. Routing
      1. Workflow for add/update jobs:
        1. Workflow to send the generated add or update jobs.
      2. Workflow for delete jobs:
        1. Workflow to send the generated delete jobs.
      3. Workflow for error jobs:
        1. Workflow to send the generated error jobs.
    Image Removed


    Anchor
    Step3b
    Step3b
    Step 3b. Share rule to a library

    Once the application is set, it must be shared to a (new or existing) library.

    Note: This is a required step. 

    Anchor
    Step3c
    Step3c
    Step 3c. Copy the rule from the shared library

    Add the shared application from the library to the Delete workflow.

    Note: This is a required step. 

    warning


    Info

    In order to work, the application requires Extract Text to

    disable the ExtractText stage performed by the connector on the connector's Advanced Properties.

    be disabled in the connector configuration.

    image2020-1-7_15-22-48.pngImage Added

    Image AddedImage Added

    Once you've clicked on the Add button, it will take a moment for Aspire to download all of the necessary components (the Jar files) from the Maven repository and load them into Aspire. Once that's done, the application will appear in the Workflow Tree.

    Info

    For details on using the Workflow section, please refer to Workflow introduction.