The IBM Connections Connector performs full and incremental scans over a IBM Connections site and will extract security, metadata, and content from each object scanned. The connector allows you to select whether you wish all applications or only some of them (Activities, Blogs, Files, etc). Each scanned object will be tagged with one of three possible actions--add, update, or delete--and can be routed to any Aspire pipeline as desired.

The connector, once started, can be stopped, paused or resumed via the Scheduler Component. Typically the start job will contain all information required by the job to perform the scan. When pausing or stopping, the connector will wait until all the jobs it published have completed before updating the statistics and status of the connector.

IBM Connections Application Bundle
AppBundle Name	IBM Connectios Connector
Maven Coordinates	com.searchtechnologies.aspire:aspire-ibmconnections-source
Version	3.1
Inputs	AspireObject from a content source submitter holding all the information required for a crawl
Outputs	An AspireObject containing the URL, content, ACLs and Metadata processed for each file.

Configuration

This section lists all configuration parameters available to install the IBM Connections Application Bundle and to execute crawls using the connector.

Property	Type	Default	Description
IBMServer	string	none	The Url of the IBM Connection server to crawl (you have to specify the protocol).
IBMUser	string	none	The Username to connect with.
IBMPassword	string	none	The password of the Username to connect with.
Page Size	integer	100	Specifies the number of entries per page to return in the crawl
useLTPA	boolean	false	true if the connector is going to use LTPA token for authentication.
IBMLoginUrl	string	none	IBM Connections login page (contains the LTPA token)
extractACL	boolean	false	true if the connector is going to requires user's GUID from LDAP server
ldapComponent	string	none	LDAP Cache component that was configure in the Services Section.
crawlAllApps	boolean	false	false, All the default endpoints (applications: Activities, Blogs, Bookmarks, Communities, Files, Forums, Profiles, and Wikis) will be crawl.The user should select the applications that want to crawl. true, The user will select which application wants to crawl.
withLimitedAccess	boolean	false	true if the connector is going to hace have limited access to the network or internet
withPatterns	boolean	false	true, if the user defined the accessible servers using patterns false, if the user defined the accessible server's names or ips


addRequestProperty	List<String>		Specifies the header and the value of the Request Property or Properties
geTPSize	integer	5	Number of threads for the thread pool to download users and groups from IBM server.
shouldBackoff	boolean	false	If true, the connector will have a back off re-connection method when the server returns the specified error.
backoffErrorPattern	regex	false	Indicate the regex to match the error message to back-off.
backoffMinutes	integer	15	Time to wait when a back-off error is encountered.
backoffRetries	integer	3	Number of retries with backoff when error is encountered.
dateFormat	String		Format to parse the LastModifiedDate and Publish Date.
expandACL	boolean	false	If true, the containers' ACL (Communities, Forums, Activities) will be expanded during crawling.

Configuration Example

To install the application bundle, add the configuration, as follows, to the <autoStart> section of the Aspire settings.xml.

Panel

title	Configuration

<application config="com.searchtechnologies.aspire:app-ibmconnections-connector">
<properties>
<property name="generalConfiguration">false</property>
<property name="snapshotDir">${dist.data.dir}/${app.name}/scannerTimestamps</property>
<property name="disableTextExtract">false</property>
<property name="extractTextMaxSize">unlimited</property>
<property name="extractTimeout">180000</property>
<property name="workflowReloadPeriod">15s</property>
<property name="workflowErrorTolerant">false</property>
<property name="non-text-document">true</property>
<property name="nonTextDocumentsExtensions">jpg,gif,png,tif,mp3,mp4,mpg,mpeg,avi,mkv,wav,bmp,swf,jar,war,rar,zip,tgz,dll,exe,class</property>
<property name="enableFetchUrl">true</property>
<property name="fullRecovery">incremental</property>
<property name="incrementalRecovery">incremental</property>
<property name="waitForSubJobs">600000</property>
<property name="maxThreads">10</property>
<property name="jobQueue">30</property>
<property name="enableAuditing">true</property>
<property name="emitStartJob">true</property>
<property name="emitEndJob">true</property>
<property name="debug">false</property>
<property name="batchSize">50</property>
<property name="batchTimeout">60000</property>
<property name="fdServiceUrl"/>
</properties>

</application>

Page tree

Versions Compared

Old Version 7

New Version Current

Key

Configuration

Configuration Example

Page tree

Page History

Versions Compared

Old Version 7

New Version Current

Key

Configuration

Configuration Example