Page tree

Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.

Box connector will crawl content from Box repository. The connector will retrieve the supported elements using the RESTful API (Content API Basics 2.0 version), for authentication will use Box API (that uses OAuth 2).


On this page:

Table of Contents


Some of the features of the Box connector include:

    • Ability to perform either full or incremental crawling (so that only new/updated documents are indexed)
    • Possibility of exclude a folder or a set of folders and their content.
    • Possibility of exclude or include elements (folders or files) by file name or folder name using regular expression (regex patterns)
    • Metadata extraction
    • it is search engine independent
    • Runs from any machine with access to the given Box account
    • Fetches access control lists (ACLs)

Content Retrieved

The Box connector retries several types of documents, listed bellow are the inclusions and exclusions of these documents.


    • Folders
    • Folder’s collaborations
    • Files
    • Box Note
    • Bookmark
    • Google Doc
    • Google Spreadsheet
    • Word document
    • PowerPoint document
    • Excel Spreadsheet
    • File’s comments
    • File’s tasks
    • Task’s assignments
    • Users and Groups (memberships)
    • Events (for Incremental crawls)
      • ITEM_MOVE
      • ITEM_COPY
      • ITEM_TRASH


  • Example Doc Type


Due to API limitations, Box connector has the following limitations:

  • Example limitationExample ReasonBox connector crawls only the latest version of files.
  • Box connector does not crawls any Trash items (folders or files)

    Incremental limitations
  1. When there is change on a Box Note will not reflect in the incremental crawl.

Future Development Plan 


The following features are not currently implemented, but are on the development plan:

  • Example future plan

Anything we should add? Please let us know.