The File System Connector can be configured using the Aspire Admin UI. It requires the following entities to be created:
Connection
Connector
Seed
Below are examples of how to create the Connection and the Seed. For the Connector, please refer to this page.
Create Connection
On the Aspire Admin UI, go to the connections tab
All existing connection will be list. Click on the new button
Enter the new connection description.
Select Filesystem from the Type list.
General
Base Path: Path of the base directory to crawl. All the seeds will be prefixed with this value to form the full path. All the seeds will be prefixed with this value to form the full path
Ignore Symbolic Links: If enabled, symbolic links will not be processed and links in the root items will cause an error.
Index directories: Enable to index the directories.
Scan recursively: Enable to scan discovered directories recursively.
Scope
Scan excluded items: Enable to force the scan of excluded directories, so child items within the scope can be found.
Include patterns: Patterns to match against document URL, if any of them match, the document will be included in the crawl.
Exclude patterns: Patterns to match against document URL, if any of them match, the document will be excluded from the crawl.
Security
Static ACLs: These ACLs will be added to all the documents.
Name: Name of the ACL.
Domain: Domain to which the ACL belongs to.
Entity: Whether this ACL is for a group or a user.
Access: Whether this ACL will have access to crawled files.
Policies
Throttle Policy: Select the throttle policy that applies to this connection object.
Routing Policies: Select the routing policies that this connection will use.
Create Connector
For the creation of the Connector object using the Admin UI, please refer to thispage
Create Seed
On the Aspire Admin UI, go to the seeds tab
All existing seed will be list. Click on the new button
Enter the new seed description.
Select Filesystem from the Type list.
Seed: The subdirectory to crawl. This value will be appended to the base path of the connection.
Connector: The ID of the connector to be used with this seed. The connector type must match the seed type.
Connection: The ID of the connection to be used with this seed. The connection type must match the seed type.
Workflow: The IDs of the workflows that will be executed for the documents crawled.
Tag: The tags of the seed, these can be used to filter the seed.
Policies
Throttle Policy: Select the throttle policy that applies to this connection object.
Routing Policies: Select the routing policies that this connection will use.