Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

 

 

The Amazon S3 connector will crawl content from any Amazon Simple Storage Service. 

Features


Some of the features of the Amazon S3 connector include:

  • Performs incremental crawling (so that only new/updated documents are indexed)
  • Fetches Object ACLs (access control lists) for S3 document-level security
  • Is search engine independent
  • Runs from any machine with access to the given S3 repository

Content Retrieved


The Amazon S3 connector retries several types of documents, listed bellow are the inclusions and exclusions of these documents.

Include

  • Documents stored in buckets
  • Documents stored in folders
  • Documents stored in subfolders

Future Development Plan 

Currently, when document metadata is extracted during indexing, the Object ACLs are being added to each document. These ACLs specify the users (registered on Amazon) who have permission to read the document.

Additional security features, including handling policy ACLs and group permissions, are planned and will be available in the future.

Anything we should add? Please let us know.