Standalone Mode

Aspire 3.0 Connectors works by implementing an interface called RepositoryAccessProvider. It specifies the minimum required methods to access, fetch, and scan a given Repository. The Aspire 3.0 Connector Framework is a layer that provides common control code for Full/Incremental Crawling, Distributed Processing, Group Expansion, Schedules and the link between the Aspire Admin User Interface and the crawls, all of this by calling the RepositoryAccessProvider methods when it requires to access the Repository.

This separation of the Connector Framework and the Connectors Implementations allows a very natural usage of the Connectors Implementations outside Connector Framework and even outside Aspire at all.

The RepositoryAccessProvider is responsible for three main different tasks

Initializing the crawl configuration or SourceInfo, from the user configuration properties
1. Initial URL, username, passwords, etc.
2. Method: newSourceInfo(AspireObject properties)
Extract the initial or root crawl items
1. Method: processCrawlRoot(SourceItem root, SourceInfo info, ScanListener listener)

For Legacy connectors standalone crawls see:

Connector Scanner Stage Test Harness

Page tree

Standalone Mode