You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 6 Next »

The crawl state is controlled by the manager to which the crawl is assigned. This is done to prevent any issues that could show up due to synchronization between nodes. The crawl state is split into two phases:

  • Crawl Phase
  • Crawl State


Crawl Phase


The crawl phase indicates the action the crawl is currently performing. Each crawl will will transition to the following phase if a set of conditions is met. Not all crawl types, transition through all phases. These are the phases for each crawl type:

  • Full Crawl:
    1. Idle
    2. Crawl Start
    3. Crawl
    4. Post-Reprocess (if enabled)
    5. Crawl End
  • Incremental Crawl:
    1. Idle
    2. Crawl Start
    3. Pre-Reprocess (if enabled)
    4. Crawl
    5. Deletes
    6. Post-Reprocess (if enabled)
    7. Crawl End
  • Identity Crawl:
    1. Idle
    2. Crawl Start
    3. Crawl
    4. Crawl End










Crawl State


Each crawl phase has a set of possible states. The allowable states for a given phase will be a subset of all the possible states, with some states not being allowed in a given phase (as they do not make sense – “pausing” in the idle phase for instance). The diagram shows the crawl phases and the allowable states.


  • No labels