Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Status
colourYellow
titleIn Progress

Panel
titleOn this page

Table of Contents



Image ModifiedStep 1. Launch Aspire and

open

Open the Content Source Management

Page

page

Launch Aspire (if it's not already running). See:

Step 2. Add a new

Content Source

content source

For this step

please

, follow the step from the Configuration Tutorial of the connector of you choice, please refer to Connector list


Step 3. Add a new Publish to HDFS to the

Workflow

workflow

To add a Publish to HDFS drag HDFS, drag from the Publish to HDFS rule from the Workflow Library and drop to the Workflow Tree where you want to add it. This will automatically open the Publish to HDFS window for the configuration of the publisher.

Step 3a. Specify

Publisher Information

publisher information

 In the Publish to HDFS window, specify the connection information to publish to the HTTP.

    1. Enter the name of the publisher. (This name must be unique).
    2. Enter the description of the publisher that will be shown in the Workflow Tree.


      Note

      Not all HDFS clusters have WebHDFS enabled.

      3.a. Publish using Web HDFS

      In the HDFS section of the Publish to Web HDFS window specify the connection information to publish to HDFS.

      1. Enter the HDFS URL. Use hdfs:// protocol and the port (by default  between 50070-50075). I.e. http://name-node-server:50070/webhdfs/v1/
      2. Specify the absolute HDFS Folder Path where the files will be published to. I.e. /user/jsmith/my_aspire_output. (The user which runs Aspire must have write access to the HDFS folder).
      3. Specify the Delegation Token it will contain the credentials to publish into HDFS
      4. Debug: Check if you want to run the publisher in debug mode.
      5. Click on the Add button.

      Image Added

  1. ExampleExample

Once you've clicked on the clicked Add button, it will take a moment for Aspire to download all of the necessary components (the Jar files) from the Maven repository and load them into Aspire. Once that's done, the publisher will appear in the Workflow Tree.

Info

For details on using the Workflow section, please refer to Workflow introduction.