The Azure Blob Storage Connector will crawl content from an Azure Blob Container repository.
Microsoft Azure Blob storage is Microsoft's object storage solution for the cloud. Blob storage is optimized for storing massive amounts of unstructured data, such as text or binary data.
For more information about Azure Blob storage, see the official Microsoft Azure Blob Storage documentation.
The Azure Blob Storage connector supports crawling the following the repositories:
Repository | Version | Connector Version |
---|---|---|
Azure Blob Storage | All | 5.1 |
To access the Azure Blob storage, a connection must be established to a valid Azure storage account.
Microsoft Azure storage is a service that is independent of Accenture Aspire technologies and licenses. See Create a storage account.
User Account Requirements
To access the Azure Blob Container, a connection string must be supplied. See Microsoft's Manage Storage Account Access Keys documentation for the steps on how to get the connection string.
Name | Supported |
---|---|
Content Crawling | Yes |
Identity Crawling | Use Azure Identity Connector |
Snapshot-based Incrementals | Yes |
Non-snapshot-based Incrementals | No |
Document Hierarchy | Yes |
The Azure Blob Storage connector has the following features:
The Azure Blob Storage connector can crawl the following objects:
Name | Type | Relevant Metadata | Content Fetch and Extraction | Description |
---|---|---|---|---|
Container | container | N/A | Organizes a set of blobs, similar to a directory in a file system | |
Blob | document | Yes | Store text and binary data |