Before Beginning, Create User account and Set Access Right sections should be in prerequisites
For details on using the Aspire Content Source Management page, see Admin UI.
Select a "Content Source" to work with.
In the "Connector" tab:
If you need text extraction, you will need to add a text extraction stage in to the work flow later
In the Workflow tab:
install the binary writer either by dragging it from the Applications section of the workflow configuration or by adding a custom application with the group id “com.searchtechnologies.aspire” and the artifact id “app-hdfs-binary writer”
Configure the parameters
Now that the HDFS writer is set up, a crawl can be initiated.