This section describes the Configuration for Connection used by the Amazon S3 Seeds.

Step 1. Open the Aspire Admin UI.

Browse to the Aspire Admin UI. It is typically located at http://localhost:50505.



Step 2. Select the Connection option from the left-hand menu.

The "Connection" option, identified by a "connection" image , is located on the left side of the application, between the "Credentials" and "Connector Instances" options. Click on it to navigate to the "Connection" page.



Step 3. Specify Connection Description and Type

Once on the "Connection" page, click on the "+New" option to create a new Connection or select an existing one to modify it.

  • Description: specify a description for the Connection. It is advised for it to be concise and meaningful.
  • Type: select "S3" as the type for the Connection.



Step 4. Specify Connection General information

Once the type has been selected, you will be presented with the "General" section of the "Connection" page. Here, you need to enter the following information for the Connection:

  • AWS Region: the AWS region where the s3 storage is located.
  • Use global endpoint: enable to use the global endpoint (if unchecked, the connection might fail, depending on the bucket region).
  • Allowed storage classes: by default, all storage classes are retrieved. Uncheck storage classes that should not be retrieved.
  • Index folders: if enabled, folders will be indexed.
  • Scan recursively: if enabled, discovered items are scanned recursively.



Step 5. Specify Scope (Optional)

The "Scope" section is located between the "General" section and the "Credentials" section of the "Connection" page. Here, you can specify document inclusions and exclusions, based on the regular expression patterns:

  • Scan excluded items: if enabled, scans excluded container items so documents inside them can be processed.
  • Include patterns: list of regular expressions to match documents to be included in the crawl.
  • Exclude patterns: list of regular expressions to match documents to be excluded from the crawl.


Step 5. Specify Credentials

The "Credentials" section is located between the "Scope" section and the "Policies" section of the "Connection" page. Here, you have to select a set of previously created Amazon S3 Credentials to be used from the Credentials' combo box.


Step 6. Specify Policies (Optional)

The "Policies" section is the last section, located right below the "Credentials" section of the "Connection" page:

  • Throttle Policy: here, you can select a previously created Throttling Policy from the Throttle Policy combo box.
  • Route Policy: here you can select a previously created Routing Policy from the Route Policy combo box.




Step 7. Save the Connection

Click on the "Complete" button to save the new Connection (when updating, the button option will read "Save" instead of "Complete").



  • No labels