Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Aspire provides a list of components to interact with or work inside of Hadoop.

Panel
titleOn this page

Table of Contents

Publish to HDFS Aspire Components

Feature only available with Aspire Enterprise  Post HDFS
  • Use this stage to write AspireObjects of each processed Aspire job to HDFS
Feature only available with Aspire Enterprise  Post WebHDFS
  • Use this stage to write AspireObjects of each processed Aspire job to the WebHDFS REST API

Aspire for Hadoop Components

Feature only available with Aspire Enterprise  Emit
  • Use this stage to write a key and the AspireObject of the incoming job to the Hadoop context
Feature only available with Aspire Enterprise  Reducer Subjob Extractor
  • From a Reduce job, for every AspireObjectWritable entry associated with the key, creates a subjob with the embedded AspireObject

Hadoop Map Reduce Job

Feature only available with Aspire Enterprise  Aspire Hadoop Map Reduce Job
  • A generic Hadoop Job Driver that can be configured to execute map, reduce or/and combine tasks using Aspire pipelines.

HDFS Utilities

Feature only available with Aspire Enterprise  Load HDFS
  • Loads an AspireObject json from HDFS using a key
Feature only available with Aspire Enterprise  Copy To HDFS
  • Copies a local file/folder into HDFS
Feature only available with Aspire Enterprise  Delete From HDFS
  • Deletes an HDFS file/folder

Hadoop Solutions

Feature only available with Aspire Enterprise  Semantic Co-occurrence Solution
  • The co-occurrence or collocation of words to form short phrases (2-4 words) can be useful in tagging content and performing query enhancement by adding a level of meaning to these phrases and therefore improved relevancy for result sets.