Log in
Skip to sidebar
Skip to main content
Confluence
Spaces
Hit enter to search
Help
Online Help
Keyboard Shortcuts
Feed Builder
What’s new
Available Gadgets
About Confluence
Log in
Aspire 3.1 (Ash)
Pages
Search
Page tree
Browse pages
Configure
Space tools
A
t
tachments (0)
Page History
Page Information
Resolved comments
View in Hierarchy
View Source
Export to PDF
Export to Word
Pages
Home Aspire 3.1
Jira links
Hadoop Components
Created by
Unknown User (nnavarro)
, last modified by
user-1b188
on
Aug 02, 2017
Aspire provides a list of components to interact with or work inside of Hadoop.
On this page:
Publish to HDFS Aspire Components
Post HDFS
Use this stage to write
AspireObjects
of each processed Aspire job to HDFS
Post WebHDFS
Use this stage to write
AspireObjects
of each processed Aspire job to the WebHDFS REST API
Aspire for Hadoop Components
Emit
Use this stage to write a key and the AspireObject of the incoming job to the Hadoop context
Reducer Subjob Extractor
From a Reduce job, for every AspireObjectWritable entry associated with the key, creates a subjob with the embedded AspireObject
Hadoop Map Reduce Job
Aspire Hadoop Map Reduce Job
A generic Hadoop Job Driver that can be configured to execute
map
,
reduce
or/and
combine
tasks using Aspire pipelines.
HDFS Utilities
Load HDFS
Loads an AspireObject json from HDFS using a key
Copy To HDFS
Copies a local file/folder into HDFS
Delete From HDFS
Deletes an HDFS file/folder
Hadoop Solutions
Semantic Co-occurrence Solution
THIS ITEM IS BEING DEPRECATED.
The co-occurrence or collocation of words to form short phrases (2-4 words) can be useful in tagging content and performing query enhancement by adding a level of meaning to these phrases and therefore improved relevancy for result sets.
No labels
Overview
Content Tools
{"serverDuration": 103, "requestCorrelationId": "f346f7612daf8f23"}