The Amazon S3 connector will crawl content from any Amazon Simple Storage Service.

On this page

Features

Some of the features of the Amazon S3 connector include:

Performs incremental crawling (so that only new/updated documents are indexed)
Fetches Object ACLs (access control lists) for S3 document-level security
Is search engine independent
Runs from any machine with access to the given S3 repository

Content Retrieved

The Amazon S3 connector retrieves several types of documents. Listed below are the inclusions and exclusions to these documents.

Include

Documents stored in buckets
Documents stored in folders
Documents stored in subfolders

Future Development Plan

Currently, when document metadata is extracted during indexing, the Object ACLs are being added to each document. These ACLs specify the users (registered on Amazon) who have permission to read the document.

Additional security features, including handling policy ACLs and group permissions, are planned and will be available in the future.

Anything we should add? Please let us know.

Page tree

Amazon S3 Introduction

Features

Content Retrieved

Include

Future Development Plan