Introduction

The SharePoint 2019 connectors will crawl content from any SharePoint 2019 site collection URL. The connector will retrieve Sites, Lists, Folders, List Items and Attachments, as well as other pages (in .aspx format).

Environment and Access Requirements

Repository Support

The SharePoint 2019 connectors supports crawling the following the repositories:

Repository	Version	Connector Version
SharePoint	2019	5.1.1

Account Privileges

The connectors offer one authentication options to access the SharePoint REST API: user account.

User Account

To configure a user crawl account, use the following, see SharePoint 2019 - Crawl Account Access.

To use a user crawl account on multiple site collections, you'll have to follow the steps on each site collection the access is needed.

Environment Requirements

The connector uses SharePoint's REST API, so the Aspire Worker nodes must have access to connect to the SharePoint 2019 environment.

Framework and Connector Features

Framework Features

Name	Supported
Content Crawling	yes
Identity Crawling	yes
Snapshot-based Incrementals	yes
Non-snapshot-based Incrementals	yes
Document Hierarchy	yes

Connector Features

The SharePoint 2019 connector has the following features:

Item filtering using include and exclude regex patterns. This is based on the item's display URL.
Access Control Lists (ACLs) fetching, for document level security.
Support for BCS external lists.
Performs non-snapshot-based incremental crawling (so that only new/updated documents are indexed) using SharePoint's change log tokens.

Content Crawled

The SharePoint Online connector can crawl the following objects:

Name	Type	Relevant Metadata	Content Fetch & Extraction	Description
Sites	container	Last Modified Date	N/A	Any site or subsite underneath a seed. Not the same as the .aspx page for a SharePoint Site
Lists	container	Last Modified Date Data size	N/A	Any type of SharePoint list including (but not limited to): Document Libraries, External Lists, Calendars, Task Lists, etc.
Folders	container		N/A	List Item Folders found on lists like Document Libraries or Link Lists.
ListItems	document		Yes	ListItems can take a number of different formats. For example, documents (PDF, doc, ppt, etc.), calendar events or announcements. For more info on how ListItems content types work, go to the MSDN article.
Attachments	document		Yes	A document attached to a SharePoint List Item.

Limitations

Due to API limitations, the SharePoint 2019 connector has the following limitations:

The connector uses the REST API to access SharePoint databases directly; it doesn't use web crawling.
Crawling is only supported using a Site or a List as a root URL.

Page tree

Introduction

Environment and Access Requirements

Repository Support

Account Privileges

User Account

Environment Requirements

Framework and Connector Features

Framework Features

Connector Features

Content Crawled

Limitations

Contact Us: [email protected]

Page tree

SharePoint 2019 - Features

Introduction

Environment and Access Requirements

Repository Support

Account Privileges

User Account

Environment Requirements

Framework and Connector Features

Framework Features

Connector Features

Content Crawled

Limitations

Contact Us: [email protected]