Features

The RSS connector will crawl content from RSS feeds. If you don't know what RSS is, see the Wikipedia entry here.

The RSS Connector component was written during an Extreme Programming exercise at the company kick-off. We made it available as proof of concept connector. If you find bugs, or if doesn't do what you want, let us know and we'll try to fix it.

Some of the features of the RSS connector include:

  • Retrieves content from multiple RSS feeds
  • Performs incremental crawling so that only new/updated documents are indexed, using a timestamp.
  • DOES NOT delete content from the index
  • Is search engine independent
  • Runs from any machine with access to the given RSS server

NOTE: The RSS connector does not support document level security. All documents and files crawled by the RSS connector will be submitted without ACLs

  • No labels