Available Connectors

Please check below the connectors available to Aspire 4.0 and 5.0.

Repository Type	Connector	Description	Aspire 4.0	Aspire 5.0
File System	File System	Extracts documents from a locally accessible File System path	✓	✓
	SMB	Extracts documents from remote sharing servers using the Server Message Block (SMB) protocol.	✓	✓
	FTP	Extracts documents from remote servers using the File Transfer Protocol (FTP)	✓	☐
	Amazon S3	Extracts documents from S3 buckets in any region on AWS	✓	✓
	Box.com	Extracts documents from Box.com	✓	☐
	HDFS	Extracts documents from the Hadoop Distributed File System (HDFS) via WebHDFS	✓	☐
	OneDrive	Extracts documents from Microsoft OneDrive accounts	✓	☐
	Azure Data Lake	Extracts documents from Microsoft Azure Data Lake Store cloud	✓	☐
	Azure Blob Storage	Extracts documents from Microsoft Azure Blob Storage service	✓	☐
	Azure File Storage	Extracts documents from Microsoft Azure File Storage service	✓	☐
Events, Messaging and Streaming	Azure Events Hub	Extract events from Microsoft Azure Events Hub service	✓	☐
	Apache Kafka	Extracts events from Apache Kafka event streaming platform	✓	☐
	Amazon Kinesis	Extracts records from Amazon Kinesis Data Streams	✓	☐
	RSS	Extracts items from RSS feeds	✓	☐
Relational Databases	RDB via Table	Extracts content from Relational Database SQL queries, and performs incremental updates based on Update-Table queries.	✓	✓
	RDB via Snapshots	Extracts content from Relational Database SQL queries, and performs incremental update by using a content digest Snapshot table	✓	✓
	Database Server	Scans all databases within a server, extracts table information from all databases and extract rows from all tables.	✓	_{February 2022}
Content Management Systems	Documentum	Extracts documents stored in docbases, cabinets, folders and sub-folders within Documentum	✓	☐
	Documentum DQL	Extracts documents using DQL query language for full and incremental crawls. ACLs extraction also expressed as DQL statements.	✓	☐
	SharePoint 2013	Extracts documents from Microsoft SharePoint 2013 (sites, lists, external lists, folders, documents or list items, attachments)	✓	☐
	SharePoint 2016	Extracts documents from Microsoft SharePoint 2016 (sites, lists, external lists, folders, documents or list items, attachments)	✓	☐
	SharePoint 2019	Extracts documents from Microsoft SharePoint 2019 (sites, lists, external lists, folders, documents or list items, attachments)	☐	☐
	SharePoint Online	Extracts documents from Microsoft SharePoint Online (sites, lists, external lists, folders, documents or list items, attachments)	✓	✓
Collaboration	Atlassian Confluence	Extracts documents from Confluence repositories, including: spaces, blogs, pages, attachments and comments	✓	☐
	IBM Connections	Extracts content from IBM Connections servers including Activities, Blogs, Bookmarks, Files, Forums, Wikis, Profiles and Communities	✓	☐
	Atlassian Jira	Extracts content from different Jira issue types: (Bug, CCB, Device Profile, Epic, Improvement, Information, Inquiry, New Feature, Question, etc)	✓	☐
	Salesforce	Extracts content from Salesforce including Accounts, Campaings, Cases, Contracts, Contacts, Chatters, Documents, Groups, Ideas, Leads, Opportunities, Partners, Pricebooks, Products, Profiles, Solutions, Tasks, User, Knowledge Articles and Attachments.	✓	☐
	ServiceNow	Extracts content from ServiceNow including Knowledge Articles, Article Categories, Knowledge Bases, Attachments, ACLs, Users and Catalog Items	✓	☐
	Adobe Experience Manager (AEM)	Extracts content from an Adobe Experience Manager (AEM) server including all page and asset objects	✓	☐
CRM	RightNow	Extracts content from a RightNow instance including Answers, Attachments and Incidents	✓	☐
Web Crawler	Aspider Web Crawler	Extract pages and documents from websites by following links inside HTML pages. Static websites supported. Multiple Authentication mechanisms	✓	_{February 2022}
Web Crawler	Selenium Crawler	Extract pages and documents from websites by following links inside HTML pages. Dynamic websites supported. Uses the Selenium framework to render the pages in real browser instances. Highly flexible crawling by scripting behaviors on the browser.	✓	_{February 2022}
Social Networks	Jive	Extracts content from any Jive Community using REST API v3. Includes documents stored in spaces, groups, projects, blogs and any sub-folders.	✓	☐
	Twitter	Extracts tweets and metadata from any twitter account, includes Tweet Text, URL Links, Geo Location, Hashtags, User mentions, Media entities, Retweet count	✓	☐
	Yammer	Extracts content from Yammer messages by Group, Thread and Topic.	✓	☐
NoSQL Database	HBase	Extracts content stored in the objectData field of the tables in an HBase server.	✓	☐
NoSQL Database	Elasticsearch	Extracts documents stored in an Elasticsearch index using a query to filter the documents to extract.	✓	_{February 2022}
Identity Providers	Group Expansion	Given a list of users with group memberships, recursively expand the group membership information to compute the complete list of group memberships for any given user.	✓	✓
	LDAP	Retrieves users, groups and memberships from any LDAP server	✓	☐
	Atlassian Confluence Identities	Retrieves users, groups and memberships stored in Confluence repositories.	✓	☐
	Azure Active Directory	Retrieves users, groups and memberships from Azure Active Directory.	✓	✓
Other	MS Exchange	Extracts content from the Exchange Servers including Mail (and attachments), Calendar and Contact	☐	☐
Other	REST Connector		☐	✓

Page tree

Connectors

Available Connectors

File System

✓

✓

✓

✓

✓

☐

✓

✓

✓

☐

✓

☐

✓

☐

✓

☐

✓

☐

✓

☐

Events, Messaging and Streaming

✓

☐

✓

☐

✓

☐

✓

☐

Relational Databases

✓

✓

✓

✓

✓

Content Management Systems

✓

☐

✓

☐

✓

☐

✓

☐

☐

☐

✓

✓

Collaboration

✓

☐

✓

☐

✓

☐

✓

☐

✓

☐

✓

☐

CRM

✓

☐

Web Crawler

✓

✓

Social Networks

✓

☐

✓

☐

✓

☐

NoSQL Database

✓