Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

The Apache Kafka Connector will crawl content from a  repository.


Easy Heading Free
navigationTitleOn this Page
wrapNavigationTexttrue
navigationExpandOptionexpand-all-by-default

Introduction


The Apache Kafka Connector will crawl content from a filesystem with PLAINTEXT protocol.

Environment and Access Requirements


Repository Support

The Apache Kafka connector supports crawling the following the repositories:

RepositoryVersionConnector Version
Kafka5.0.25.0.2

Account Privileges

For the Apache Kafka connector to be able to crawl content, the Aspire Worker nodes must be run with an account with .

Environment Requirements

Requirementversion
Apache Kafka3.0.0

Framework and Connector Features


Framework Features

Name Supported
Content Crawlingyes
Identity Crawlingno
Snapshot-based Incrementalsyes
Non-snapshot-based Incrementalsyes
Document Hierarchyno

Connector Features

The Apache Kafka connector has the following features:

  • Extract documents from multiple Kafka topics.
  • Select between one and multiple servers to crawl.


Content Crawled


The Apache Kafka connector is able to crawl the following objects:

NameTypeRelevant MetadataContent Fetch and ExtractionDescription
Filedocument
  • partition
  • offset
  • topic
  • value
N/AThe files contained by the directories in the crawled file system.

Limitations


The Apache Kafka Connector has the following limitations:

  • REST API imlpementation is part of our future development plan.
  • No authentication mechanisms have been implemented to date.