Step 1. Launch Aspire and Open the Content Source Management Page
Step 2. Select or Add a Content Source
Select a "Content Source" to work with.
Step 2a. Disable Text Extraction
Step 2b. Configure Workflow Information
Step 3: Initiate a Crawl
Now that the HDFS writer is set up, a crawl can be initiated.
- When crawling the content will be written to HDFS