The Amazon S3 connector will crawl content from any Amazon Simple Storage Service. 

On this page


Some of the features of the Amazon S3 connector include:

  • Performs incremental crawling (so that only new/updated documents are indexed)
  • Fetches Object ACLs (access control lists) for S3 document-level security
  • Is search engine independent
  • Runs from any machine with access to the given S3 repository

Content Retrieved

The Amazon S3 connector retrieves several types of documents. Listed below are the inclusions and exclusions to these documents.


  • Documents stored in buckets
  • Documents stored in folders
  • Documents stored in subfolders

Future Development Plan 

Currently, when document metadata is extracted during indexing, the Object ACLs are being added to each document. These ACLs specify the users (registered on Amazon) who have permission to read the document.

Additional security features, including handling policy ACLs and group permissions, are planned and will be available in the future.

Anything we should add? Please let us know.

  • No labels