You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

 

 

The SharePoint 2013 Scanner component performs full and incremental scans over a SharePoint 2013 repository, maintaining the last SharePoint change token of the repository to get updates next time an incremental crawl is executed. Updated content is then submitted to the configured pipeline in AspireObjects attached to Jobs. As well as the URL of the changed item, the AspireObject will also contain metadata extracted from the repository. Updated content is split in to three types: add, update, and delete. Each type of content is published on a different event so that it may be handled by different Aspire pipelines.

The scanner reacts to an incoming job. This job may instruct the scanner to startstoppauseresume or cacheGroups. Typically the start job will contain all information required by the job to perform the crawl. However, the scanner can be configured with default values via application.xml file. When pausing or stopping, the scanner will wait until all the jobs it published have completed before itself completing.

SharePoint 2013 Scanner
Factory Namecom.searchtechnologies.aspire:aspire-sharepoint2013-connector
subTypedefault
InputsAspireObject from a content source submitter holding all the information required for a crawl
OutputsJobs from the crawl

Configuration


This section lists all configuration parameters available to configure the SharePoint 2013 Scanner component.

General Scanner Component Configuration

Basic Configuration

ElementTypeDefaultDescription
    

 

Branch Handler Configuration

ElementTypeDefaultDescription
    

 

SharePoint Scanner Configuration

ElementTypeDefaultDescription
    



Example Configuration


Simple

 

 

Complex

 


 

  • No labels