RSS Connector Introduction (Aspire 2)

Features

The RSS connector will crawl content from RSS feeds. If you don't know what RSS is, see the Wikipedia entry here.

The RSS Connector component was written during an Extreme Programming exercise at the company kick-off. We made it available as proof of concept connector. If you find bugs, or if doesn't do what you want, let us know and we'll try to fix it.

Some of the features of the RSS connector include:

Retrieves content from multiple RSS feeds
Performs incremental crawling so that only new/updated documents are indexed, using a timestamp.
DOES NOT delete content from the index
Is search engine independent
Runs from any machine with access to the given RSS server

NOTE: The RSS connector does not support document level security. All documents and files crawled by the RSS connector will be submitted without ACLs

Page tree

RSS Connector Introduction (Aspire 2)

Features