The File System connector will crawl content from the file system location.

Introduction

The File System connector can scan and fetch the directories and documents of a file system.

Environment and Access Requirements

Repository Support

The File System supports crawling the following the repositories

Repository	Version	Connector Version
Windows	All	5.0
Linux	All	5.0

This component has been officially tested on local Windows and Linux.

Account Privileges

For the File System connector to be able to crawl the Aspire Worker nodes must be run with an account with read access to the directory to crawl.

Environment Requirements

The Aspire Worker nodes must be running in the same machine where the file system repository is located or at least have access in the case of a file share.

Framework and Connector Features

Framework Features

Name	Supported
Content Crawling	yes
Identity Crawling	no
Snapshot-based Incrementals	yes
Non-snapshot-based Incrementals	no
Document Hierarchy	yes

Connector Features

The File System connector has the following features:

Document filtering using include and exclude regex patterns.
Static acls can be added for the documents crawled

Content Crawled

The File System connector is able to crawl the following objects:

Name	Type	Relevant Metadata	Content Fetch & Extraction	Description
Folder	container	Last Modified Date	NA	The directories of the file system. Each directory will be scanned to retrieve more directories or files
File	document	Last Modified Date Data size	yes	The files contained by the directories in the crawled file system.

Limitations

The File System Connector has the following limitations:

The connector does not retrieve the acls of the crawled documents.