Page tree
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 3 Next »

Box connector will crawl content from Box repository. The connector will retrieve the supported elements using the RESTful API (Content API Basics 2.0 version), for authentication will use Box API (that uses OAuth 2).


On this page:


Some of the features of the Box connector include:

    • Ability to perform either full or incremental crawling (so that only new/updated documents are indexed)
    • Possibility of exclude a folder or a set of folders and their content.
    • Possibility of exclude or include elements (folders or files) by file name or folder name using regular expression (regex patterns)
    • Metadata extraction
    • it is search engine independent
    • Runs from any machine with access to the given Box account
    • Fetches access control lists (ACLs)

Content Retrieved

The Box connector retries several types of documents, listed bellow are the inclusions and exclusions of these documents.


    • Folders
    • Folder’s collaborations
    • Files
    • Box Note
    • Bookmark
    • Google Doc
    • Google Spreadsheet
    • Word document
    • PowerPoint document
    • Excel Spreadsheet
    • File’s comments
    • File’s tasks
    • Task’s assignments
    • Users and Groups (memberships)
    • Events (for Incremental crawls)
      • ITEM_MOVE
      • ITEM_COPY
      • ITEM_TRASH


  • Example Doc Type


Due to API limitations, Box connector has the following limitations:

  • Example limitation
    • Example Reason

Future Development Plan 


The following features are not currently implemented, but are on the development plan:

  • Example future plan

Anything we should add? Please let us know.


  • No labels