In the "Service" tab, configure the parameters:
- HDFS Base
- The base directory on the hdfs file system under which all content will be written
- Form
- hdfs://<server>:<port>/<path>
- Example
- Content source directory
- Add one or more content source directories that will be scanned for compaction. These directories will be located below the base given above
- Form
- Example
- Period
- The period between scans of each content source directory
- Form
- Example
- Threshold
- The number of files that must exist in the directory before compaction takes place
- Suppress deletes
- Checking this box will prevent binary files that are in an existing HAR file from being deleted when a delete action is encountered.
- HDFS Options
- Security
- Choose the type of security to use to access the HDFS file system- Kerberos or none
- User principle
- The principal user for Kerberos
- Key tab file
- The user's key tab file
- Form
- Example
- Add resources
- Check this box if you need to add Hadoop resources to the configuration (such as site-core.xml)
- Resource file
- The path to a resource file to add to the configuration
- Form
- Example
- Block size
- The size of block to be used when accessing the HDFS file system
- Form
- Example
- Buffer size
- The size of buffer to be used when accessing the HDFS file system
- Form
- Example
- Replication
- The HDFS replication factor
- Form
- Example
- Configure lock
- Check this box if you want to configure the lock files
- Tries
- The number of attempts to get a lock file before moving on
- Form
- Example
- Retry sleep
- The period to wait before attempting to get the lock when a previous attempt has failed
- Form
- Example
- Expiry
- The period after which a lock is deemed to have expired and will be released when another process attempts to get it.
- Form
- Example
- Debug
- Check this box to enable debug messages
Save the service and it will start automatically