Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  
Element
Type
Default
Example
Description
useSeedsFilebooleanfalse
Allows the user to configure path to a line separated file with all the root urls that need need to be crawled.
siteCollectionsToCrawl/siteCollectionUrlStringurlString

SharePoint URL which will serve as the start of the crawl. Folders and documents underneath this URL will be automatically fetched and sent to the search engine, along with metadata and ACLs. This is the same URL you will see in your web crawler as you browse SharePointYou can add as many root urls as you want as long the user configured has the right permissions.

Note

It should not be a URL to a form or document, but the actual URL to the SharePoint object. For example instead of https://sharepoint.domain.com/Pages/home.aspx it should be https://sharepoint.domain.com.

In this version of the Aspire SharePoint 2010 Connector, the URL must be one of the following:

  • A SharePoint site collection
  • A SharePoint Site
  • A SharePoint List
seedsFilePathString${aspire.config.dir}/${app.name}/urls.txtC:\aspire\seeds.txt

The path to a txt file that contains a valid url to crawl on each line. All sites will be accessed with the same credentials:

http://www.site1.com

http://www.site2.com

...

domainString 
  •  myDomain
  • Support
  • etc.
Domain where the user will be authenticated
userString  

Username with permissions to access SharePoint content. Must have appropriate SharePoint permissions to the documents in the URL.

Note

User names should be ASCII characters

passwordString  

Password for the specified user.useExtraSiteCollectionsbooleanfalse Enables crawling of additional site collections.siteCollectionUrlstringfalse 
  • http://sharepointsite.com/sites/mySite
  • http://sharepointsite.com/sites/mySite/Shared Documents
  • http://sharepointsite.com/sites/mySite/myCollection
  • Additional site collection to crawl

    Note

    It should not be a URL to a form or document, but the actual URL to the SharePoint object. For example instead of https://sharepoint.domain.com/Pages/home.aspx it should be https://sharepoint.domain.com.

    In this version of the Aspire SharePoint 2010 Connector, the URL must be one of the following:

    • A SharePoint site collection
    • A SharePoint Site
    • A SharePoint List
    groupPrefixSeparatorString | (pipe)

    Prefix used to separate users and groups on ACL's file. For GSA users, the following characters should not be used: '@','/','\','#'; for GSA the character '|' is recommended.

    crawlAttachmentsbooleantrue
    Enable attachment crawling. Attachment files are the ones added to items inside SharePoint (e.g. documents added to calendar events or tasks).
    indexContainersbooleanfalse
    Send containers to publisher
    scanExcludedItemsbooleanfalse
    Will force scanning through excluded items to look for non excluded ones