Heritrix Prerequisites (Aspire 2)

Created by Johnny Vargas on Jun 28, 2018

The Aspire Heritrix Connector uses a custom Heritrix 3.1.1 crawl engine.

The Aspire Server running the crawl must have access to the seed(s) URL (configured in the content source configuration).
Check for any credentials needed to access the sites to be crawled. (Basic, Digest, HTTP forms and NTLM are supported)

No labels