This tutorial walks through the steps necessary to crawl SharePoint 2016 using the SharePoint 2016 connector.
Create a User Account
A prerequisite for crawling SharePoint 2016 is to have a Windows Active Directory account. The domain, username and password for this account will be required below.
The recommended name for this account is "aspire_crawl_account". See prerequisites section for more details.
Step 1: Set SharePoint Access Rights
"aspire_crawl_account" will need to have sufficient access rights to read all of the documents in SharePoint 2016 that you wish to process. See User Account Requirements for details on what rights will be required for the account in SharePoint.
To set the rights for your account at Web Application level, do the following:
- Open SharePoint Central Administration.
- Go to "Manage web applications" under "Application Management".
- Select the web application which has the site collections to crawl.
- Click on "User Policy".
- On the "Policy for Web Application" popup, click on "Add Users".
- On "Choose Users" space type aspire_crawl_account with the corresponding domain.
- Select "Full Read" permission on "Choose Permissions" section.
- Click on "Finish".
To set the rights for your "aspire_crawl_count" on site collections, do the following:
- Go to the desired site collection and log on with a site collection administrator (or any user authorized to edit site permissions).
- Click on the gear icon at the top left corner.
- Click on "Shared With...".
- Click on "ADVANCED"
- Click on Grant Permissions
- Enter the domain and account name on "Invite people to" field (i.e. aspire_crawl_account).
- Select Show Options.
- Select the Read permission.
- Click Share.
During the Crawl
- Click on the "Refresh" button on the Content Sources page to view the latest status of the crawl.
The status will show RUNNING while the crawl is going, and Completed when it is finished.
- Click on "Statistics" to view the number of documents crawled so far, the number of documents submitted, and the number of documents with errors.
If there are errors, you will get a clickable "Error" flag that will take you to a detailed error message page.
Group expansion configuration is done under "Group Expansion" in the Connector tab. Group expansion scheduling is done in the General tab.
- Click on the Enable group expansion checkbox to enable the configuration section.
- Select the Group Expansion service in the dropdown.
- Specify if an intermediate file should be used if there are memory concerns.
- Specify external group servers if any.
- Map usernames to LDAP users if needed.
- Enable group expansion workflow adds a custom groovy script.
- As an optional setting click on the "Use external Group Expansion" checkbox to select an LDAP Cache component for LDAP group expansion. See more info on the LDAP Cache component on LDAP Cache
- Specify transformation options to leave or remove the domain from the user name