Developer Information

Detailed information on configuring the Heritrix Application Bundle. Also includes a discussion of all of the metadata produced by the Application Bundle, with an example.

Detailed information on the Heritrix Scanner java component which crawls a set of web sites. Only needed by programmers who might need to integrate the Heritrix scanner in novel ways not supported by the standard framework.

  • Heritrix Group Expansion

Heritrix Connector doesn't extract ACLs, so there is no need for security filtering. If static ACLs from an LDAP server are configured (See Heritrix Scanner configuration for static ACLs configuration), the default Group Expansion Manager will be able to apply security filters based on your groups.

List of questions and answers for developers using or modifying Heritrix Scanner/Application Bundle.

  • No labels