The SharePoint 2019 connectors will crawl content from any SharePoint 2019 site collection URL. The connector will retrieve Sites, Lists, Folders, List Items and Attachments, as well as other pages (in .aspx format).
The SharePoint 2019 connectors supports crawling the following the repositories:
Repository | Version | Connector Version |
---|---|---|
SharePoint | 2019 | 5.1.1 |
The connectors offer one authentication options to access the SharePoint REST API: user account.
To configure a user crawl account, use the following, see SharePoint 2019 - Crawl Account Access.
To use a user crawl account on multiple site collections, you'll have to follow the steps on each site collection the access is needed.
The connector uses SharePoint's REST API, so the Aspire Worker nodes must have access to connect to the SharePoint 2019 environment.
Name | Supported |
---|---|
Content Crawling | yes |
Identity Crawling | yes |
Snapshot-based Incrementals | yes |
Non-snapshot-based Incrementals | yes |
Document Hierarchy | yes |
The SharePoint 2019 connector has the following features:
The SharePoint Online connector can crawl the following objects:
Name | Type | Relevant Metadata | Content Fetch & Extraction | Description |
---|---|---|---|---|
Sites | container |
| N/A | Any site or subsite underneath a seed. Not the same as the .aspx page for a SharePoint Site |
Lists | container |
| N/A | Any type of SharePoint list including (but not limited to): Document Libraries, External Lists, Calendars, Task Lists, etc. |
Folders | container | N/A | List Item Folders found on lists like Document Libraries or Link Lists. | |
ListItems | document | Yes | ListItems can take a number of different formats. For example, documents (PDF, doc, ppt, etc.), calendar events or announcements. For more info on how ListItems content types work, go to the MSDN article. | |
Attachments | document | Yes | A document attached to a SharePoint List Item. |
Due to API limitations, the SharePoint 2019 connector has the following limitations: