SharePoint Architecture
Find detailed information on MSDN article.
Summary of SharePoint organization
This is the hierarchy of processes/applications/sites/sub-sites/libraries/folders/and documents within SharePoint.
- SharePoint Server
- SharePoint Web Application Pool
- SharePoint Web Application (single web application)
- Main Site Collection (the primary or main site created for the web application, associated with the primary http://xyz.server.com URL)
- Sub Sites
- Document Libraries
- Folders
- Documents
- Attachments
- Documents
- Folders
- Document Libraries
- Sub Sites
- Other Site Collections
- Sub Sites
- Document Libraries
- Folders
- Documents
- Attachments
- Documents
- Folders
- Document Libraries
- Sub Sites
- Main Site Collection (the primary or main site created for the web application, associated with the primary http://xyz.server.com URL)
- SharePoint Web Application (single web application)
- SharePoint Web Application Pool
Content Retrieved by the Connector
The SharePoint connector will retrieve the following objects:
- Sites
- Lists
- External Lists (BCS)
- Folders
- Documents or List Items
- Attachments
ListItems can take a number of different formats. For example, documents (pdf, doc, ppt, etc), calendar events or announcements. For more info on how ListItems content types work go to the MSDN article
Operation Mode
The connector uses the REST API over HTTP or HTTPs to acquire information of SharePoint Online content.
The connector acquires content by doing the following:
- Go recursively through all sites, subsites, lists, folders and documents and creates sub-jobs for each object discovered. Each sub-job contains all metadata available, including ACLs.
- Saves a snapshot file to compare previous item states and do incremental crawls with added, updated and deleted items. This snapshot file also contains the last saved SharePoint change log timestamp which is used on the next incremental crawl to get only modified items.
Overview
Content Tools