Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.


Panel
titleOn this page

Table of Contents

Introduction


The SharePoint Online connector will crawl content from any SharePoint Online site collection URL. The connector will retrieve Sites, Lists, Folders, List Items and Attachments, as well as other pages (in .aspx format). This connector supports SharePoint running in the Microsoft 365 offering.

This is not a O365 connector, the individual repository offerings within O365, such as OneDrive, Calendar, Tasks, Yammer will have their own connectors.

Environment and Access Requirements


Repository Support

The File System supports crawling the following the repositories

RepositoryVersionConnector Version
SharePointMicrosoft 3655.0
Info

This component has been officially tested on local Windows and Linux.

Account Privileges

For the File System connector to be able to crawl the Aspire Worker nodes must be run with an account with read access to the directory to crawl.

Environment Requirements

The Aspire Worker nodes must be running in the same machine where the file system repository is located or at least have access in the case of a file share. 

Framework and Connector Features


Framework Features

NameSupported
Content Crawlingyes
Identity Crawlingno
Snapshot-based Incrementalsyes
Non-snapshot-based Incrementalsno
Document Hierarchyyes

Connector Features

The File System connector has the following features:

  • Document filtering using include and exclude regex patterns.
  • Static acls can be added the documents crawled

Content Crawled


The File System connector is able to crawl the following objects:

NameType Relevant MetadataContent Fetch & ExtractionDescription
Foldercontainer
  • Last Modified Date
NAThe directories of the file system. Each directory will be scanned to retrieve more directories or files
Filedocument
  • Last Modified Date
  • Data size
yesThe files contained by the directories in the crawled file system.

Limitations


The File System Connector has the following limitations:

  • The connector does not retrieve the acls of the crawled documents.