The Apache Kafka Connector will crawl content from a repository.

Easy Heading Free

navigationTitle	On this Page
wrapNavigationText	true
navigationExpandOption	expand-all-by-default

Introduction

The Apache Kafka Connector will crawl content from a filesystem with PLAINTEXT protocol.

Environment and Access Requirements

Repository Support

The Apache Kafka connector supports crawling the following the repositories:

Repository	Version	Connector Version
Kafka	5.0.2	5.0.2

Account Privileges

For the Apache Kafka connector to be able to crawl content, the Aspire Worker nodes must be run with an account with .

Environment Requirements

Requirement	version
Apache Kafka	3.0.0

Framework and Connector Features

Framework Features

Name	Supported
Content Crawling	yes
Identity Crawling	no
Snapshot-based Incrementals	yes
Non-snapshot-based Incrementals	yes
Document Hierarchy	no

Connector Features

The Apache Kafka connector has the following features:

Extract documents from multiple Kafka topics.
Select between one and multiple servers to crawl.

Content Crawled

The Apache Kafka connector is able to crawl the following objects:

Name	Type	Relevant Metadata	Content Fetch and Extraction	Description
File	document	partition offset topic value	N/A	The files contained by the directories in the crawled file system.

Limitations

The Apache Kafka Connector has the following limitations:

REST API imlpementation is part of our future development plan.
No authentication mechanisms have been implemented to date.

Page tree

Versions Compared

Old Version 4

New Version 5

Key

Introduction

Environment and Access Requirements

Repository Support

Account Privileges

Environment Requirements

Framework and Connector Features

Framework Features

Connector Features

Content Crawled

Limitations

Page tree

Page History

Versions Compared

Old Version 4

New Version 5

Key

Introduction

Environment and Access Requirements

Repository Support

Account Privileges

Environment Requirements

Framework and Connector Features

Framework Features

Connector Features

Content Crawled

Limitations