The OneDrive Connector will crawl content from a Microsoft One Drive repository.


Introduction


OneDrive is Microsoft's cloud service which links you to all your files. It allows you to store and protect your files, share them with other people, and access them from anywhere on all your devices. For more information about OneDrive, see the official Microsoft Overview of OneDrive in Microsoft 365 documentation.

Environment and Access Requirements


Repository Support

The OneDrive connector supports crawling the following the repositories:

RepositoryVersionConnector Version
OneDriveAll5.1

User Account Requirements

To access OneDrive, a user account with sufficient privileges must be supplied.

Azure AD Application

To access OneDrive APIs (Microsoft Graph), you'll need to create an Azure AD Application. 

Create the Azure AD application

  1. See Microsoft's Use portal to create an Azure Active Directory application and service principal that can access resources for the steps on how to create an Application ID properly and its key (Client Secret). Make sure to write down your Application Key at the time of creation. It will not be shown again after you exit the portal. 

Assign permissions to the application

  1. Log into the Azure Management Portal.
  2. Click on the “Azure Active Directory” option.
  3. Select the “App registrations” option and then select your client application.
  4. Select "API permissions" > "Add a permission" > "Microsoft APIs".

  5. Select the “Microsoft Graph” option.
  6. On Application Permissions, select the following:
    • Read files in all site collections (Files.Read.All)
    • Read and write files in all site collections (Files.ReadWrite.All)
    • Read all users’ full profiles (User.Read.All)
    • Read directory data (Directory.Read.All)
    • Read all groups (Group.Read.All)
    • Read and write items in all site collections (Sites.ReadWrite.All)
    • Read items in all site collections (Sites.Read.All)
  7. Click on “Save”.
  8. Click on “Grant admin consent for . . . ”
    • Click on “Yes” when prompted


Framework and Connector Features


Framework Features

Name Supported
Content CrawlingYes
Identity CrawlingYes
Snapshot-based IncrementalsYes
Non-snapshot-based IncrementalsNo
Document HierarchyYes

Connector Features

The OneDrive connector has the following features:

  • Performs incremental crawling (so that only new/updated documents are indexed), using OneDrive Delta Token
  • Possibility to include or exclude an item or a set of items and their content using regex patterns
  • Metadata extraction
  • Is search engine independent
  • Runs from any machine with access to the given OneDrive repository


Content Crawled


The OneDrive connector can crawl the following objects:

NameTypeRelevant MetadataContent Fetch and ExtractionDescription
Drivecontainer
N/AContains Folders and files
Foldercontainer
N/AThe directories of the files. Each directory will be scanned to retrieve more subfolders or documents.
Filedocument
YesFiles stored in folders/subfolders


Disable checkingPremium suggestionsDisable checkingPremium suggestionsDisable checkingPremium suggestionsDisable checkingPremium suggestionsDisable checkingPremium suggestions

  • No labels