The Aspire Heritrix Connector uses a custom Heritrix 3.1.1 crawl engine.

  • The Aspire Server running the crawl must have access to the seed(s) URL (configured in the content source configuration).
  • Check for any credentials needed to access the sites to be crawled. (Basic, Digest, HTTP forms and NTLM are supported)
  • No labels