Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

The Amazon S3 connector will crawl content from any Amazon Simple Storage Service. 

Panel
titleOn this page

Table of Contents

Features


Some of the features of the Amazon S3 connector include:

  • Performs incremental crawling (so that only new/updated documents are indexed)
  • Fetches Object ACLs (access control lists) for S3 document-level security
  • Is search engine independent
  • Runs from any machine with access to the given S3 repository


Content Retrieved


The Amazon S3 connector retrieves several types of documents. Listed below are the inclusions and exclusions to these documents.

Include

  • Documents stored in buckets
  • Documents stored in folders
  • Documents stored in subfolders


Future Development Plan 

Currently, when document metadata is extracted during indexing, the Object ACLs are being added to each document. These ACLs specify the users (registered on Amazon) who have permission to read the document.

Additional security features, including handling policy ACLs and group permissions, are planned and will be available in the future.

Anything we should add? Please let us know.