Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

The Elasticsearch Connector will crawl content from a Elasticsearch repository.


Easy Heading Free
navigationTitleOn this Page
wrapNavigationTexttrue
navigationExpandOptionexpand-all-by-default

Introduction


The Elasticsearch connector retrieves documents stored in an Elastic index using an Elasticsearch query to filter the documents to extract.

Environment and Access Requirements


Repository Support

The Elasticsearch connector supports crawling the following the repositories:

RepositoryVersionConnector Version
ElasticsearchALL5.0

Account Privileges

For the Elasticsearch connector to be able to crawl content, the Aspire Worker nodes must be run with an account with (permissions).

If authentication is enabled in the Elasticsearch server, a user account with sufficient privileges must be supplied.

Environment Requirements


Framework and Connector Features


Framework Features

Name Supported
Content CrawlingYES
Identity CrawlingNO
Snapshot-based IncrementalsYES
Non-snapshot-based IncrementalsNO
Document HierarchyNO

Connector Features

The Elasticsearch connector has the following features:

  • Extract documents from multiple Elasticsearch indexes
  • Use of  Query DSL to define queries.
  • Slice support for querying
  • Basic and AWS Signature V4 Authentication.
  • Use of Get or MGet Elasticsearch methods for fetching content.<Connector Feature 1>


Content Crawled


The Elasticsearch connector is able to crawl the following objects:

NameTypeRelevant MetadataContent Fetch and ExtractionDescription




Limitations


The Elasticsearch Connector has the following limitations:

  • <Limitation 1>