The File System connector will crawl content from the file system location.

Paneltitle pagetoc

Introduction

The File System connector can scan and fetch the directories and documents of a file systemAzure Identity connector will crawl Azure identities (users and groups) from the specified Azure Active Directory and store them on a identity cache.

Environment and Access Requirements

Repository Support

The File System supports crawling the following the repositories

RepositoryVersionConnector VersionWindowsAll5.0LinuxAll

5.0
Info
This component has been officially tested on local Windows and Linux.

Account Privileges

For the File System Azure Identity connector to be able to crawl the Aspire Worker nodes must be run with an account with read access to the directory to crawl.the identities from the Azure AD that corresponds to the domain specified in the connection, it needs that the specified credentials have enough permissions to read said identities from the directory.

MS Graph Application permissions needed:

GroupMember.Read.All / Application
User.Read.All / Application

Other Requirements

The Aspire Worker worker nodes must be running in the same machine where the file system repository is located or at least have access in the case of a file share. able to reach the Azure AD specified in the connection.

Info
This component has been officially tested on local Windows and Linux.

Framework and Connector Features

Framework Features

Name	Supported
Content Crawling	yesno
Identity Crawling	noyes
Snapshot-based Incrementals	yesno
Non-snapshot-based Incrementals	noyes
Document Hierarchy	yesno

Connector Features

The File System Azure Identity connector has the following features:

Document filtering using include and exclude regex patterns.
Static acls can be added the documents crawled

Lower casing of retrieved identities.
Adding a special "Everyone" group.
Filtering out external groups.

Crawled Identities

The Azure Identity

Content Crawled

The File System connector is able to crawl the following objects:

Name	Type	Metadata	Content Fetch & Extraction	Description
Folder	container	Last Modified Date	NA	The directories of the file system. Each directory will be scanned to retrieve more directories or files
File	document	Last Modified Date Data size	yes	The files contained by the directories in the crawled file system.

Limitations

The File System Connector has the following limitations:

The connector does not retrieve the acls of the crawled documents


User	document	Name Domain Groups Attributes	no	The users of the Azure AD.
Group	document	Name Domain Attributes	yes	The groups of the Azure AD.

Info

Attributes for users typically contain the user GUID but may contain additional data.

Attributes for groups typically contain their display name but may contain additional data.

Page tree

Versions Compared

Old Version 1

New Version Current

Key

Introduction

Environment and Access Requirements

Repository Support

5.0
Info
This component has been officially tested on local Windows and Linux.

Account Privileges

Other Requirements

Framework and Connector Features

Framework Features

Connector Features

Crawled Identities

Content Crawled

Limitations

Page tree

Page History

Versions Compared

Old Version 1

New Version Current

Key

Introduction

Environment and Access Requirements

Repository Support

5.0 InfoThis component has been officially tested on local Windows and Linux.

Account Privileges

Other Requirements

Framework and Connector Features

Framework Features

Connector Features

Crawled Identities

Content Crawled

Limitations

5.0
Info
This component has been officially tested on local Windows and Linux.