The Aspider Web Crawler Apache Kafka Connector can be configured using the Aspire Admin UI. It requires the following entities to be created:

Easy Heading Free

Create Connection

On the Aspire Admin UI, go to the connections page
All existing connections will be listed. Click on the new button
Enter the new connection description.
Select Apache Kafka from the Type list.
General:
1. Servers: The Kafka server(s) hostnames fololowed followed by the port. Ex. localhost:9092
2. Stop on scan error: Check if you want the connector to stop crawing crawling when an error is encountered.
3. Enable Stop Consumer: If checked, the crawl will stop at a certain timeout.
4. Schema Registry Configuration: If checked, the schema will be retrieved from Schema Registry. Schema Registry credentials are required.
  1. Schema Registry URL: the schema registry URL to connect to.
  2. Username: Schema Registry User Name.
  3. Password: Schema Registry Password.
Starting offset for ful full crawl:
1. Extract text: the auto offset reset configuration. Starting offsets: Starting offset for a full crawl.
2. Debug: Debug messages on/off.

Page tree