Before Beginning: Create User Account
A prerequisite for crawling S3 with Aspire is to have an S3 account with the correct permissions. The recommended name for this account is "aspire_crawl_account" or something similar.
The username and password for this account will be required below. (In AWS terms, this will be the Access Key ID and the Secret Key that you get from Amazon.) This user account will need sufficient permissions for crawling all the content you wish to index.
Step 1: Set S3 Access Rights
Amazon S3 manages user rights and permissions via its AWS Identity and Access Management (IAM) system.
To set up your "aspire_crawl_account", do the following:
- Log into the Amazon AWS Console as an Administrator or owner of the AWS account.
- Set up an "aspire_crawl_account" user.
- Make sure that this user has permissions to all the buckets and resources it will need to crawl the content you wish to crawl (at least Admin or root level permissions to the buckets you want to crawl).
- Copy down the Access Key ID and Secret Key for this user, as you will need it later in these instructions.
Overview
Content Tools