Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Available Connectors

Please check below the connectors available to Aspire 4.0 and 5.0.


Repository TypeConnectorDescription
Released Version

Aspire 

4.x

Aspire 

5.x

File System

FileSystem








File SystemExtracts documents from a locally accessible File System path

5.0

SMBExtracts documents from remote sharing servers using the Server Message Block (SMB) protocol.

TBD

FTPExtracts documents from remote servers using the File Transfer Protocol (FTP)
TBD

Image Removed
Extracts documents from S3 buckets in any region on AWS
TBD

Image Modified

Box

 Box.com

Extracts documents from Box.com
TBD

Image Modified

HDFS

 HDFS

Extracts documents from the Hadoop Distributed File System (HDFS) via WebHDFS

TBD

Image Modified OneDrive

Extracts documents from Microsoft OneDrive accounts
TBD

Image Removed
Extracts documents from Microsoft Azure Data Lake Store cloud
TBD

Image Removed
Extracts documents from Microsoft Azure Blob Storage service

TBD

Image Removed
Extracts documents from Microsoft Azure File Storage service
TBD

Events, Messaging and Streaming

Image Removed
Extract events from Microsoft Azure Events Hub service
TBD

Image Removed
Extracts events from Apache Kafka event streaming platform
TBD

Image Removed
Extracts records from Amazon Kinesis Data Streams

TBD

RSSExtracts items from RSS feeds
TBD

Relational Databases

RDB via TableExtracts content from Relational Database SQL queries, and performs incremental updates based on Update-Table queries.
TBD

RDB via SnapshotsExtracts content from Relational Database SQL queries, and performs incremental update by using a content digest Snapshot table

TBD

Image Modified Database Server

Scans all databases within a server, extracts table information from all databases and
extract
extracts rows from all tables.
TBD

Content Management Systems

Image Modified

Documentum

 Documentum

Extracts documents stored in docbases, cabinets, folders, and sub-folders within Documentum
TBD

Image Modified Documentum DQL

Extracts documents using DQL query language for full and incremental crawls. ACLs extraction is also expressed as DQL statements.
TBDImage Removed

SharePoint 2010Extracts documents from Microsoft SharePoint 2010 (sites, lists, external lists, folders, documents or list items, attachments)TBD

The Dropbox connector can crawl Pages, Folders and Files from a Dropbox repository. It does identity Crawling, can execute snapshot-based Incrementals and respects document hierarchy.

Image Added 

Extracts documents from Microsoft SharePoint 2013 (sites, lists, external lists, folders, documents or list items, attachments)
TBD

Image Modified SharePoint 2016

Extracts documents from Microsoft SharePoint 2016 (sites, lists, external lists, folders, documents or list items, attachments)
TBD

Image Modified SharePoint 2019

Extracts documents from Microsoft SharePoint 2019 (sites, lists, external lists, folders, documents or list items, attachments)
TBD

Image RemovedLotus Notes

Extracts documents from Lotus Notes repositories (Application and Mail Databases, Knowledge Base Documents, Mail and Attachments)TBD

Image Modified SharePoint Online

Extracts documents from Microsoft SharePoint Online (sites, lists, external lists, folders, documents or list items, attachments)
5.0

Collaboration

Image Modified Atlassian Confluence

Extracts documents from Confluence repositories, including
:
spaces, blogs, pages, attachments, and comments
TBD

Image Added IBM

Image RemovedeRoom

Extracts documents from an eRoom server instance (site) using the XML Query featureTBDImage RemovedIBM

Connections

Extracts content from IBM Connections servers including Activities, Blogs, Bookmarks, Files, Forums, Wikis, Profiles, and Communities
TBD

Image Modified

Atlassian

Image RemovedSocialcast

Extracts content from any Socialcast Community server including messages, comments, attachments, conversations, polls, users, groups, streams likes, flags and badgesTBD

Image RemovedTeamForge

Extracts documents from TeamForge including projects, discussions, documents, releases, news, project pages, planning folders, repositories, tasks, trackers and wiki servicesTBDImage Removed

 Atlassian Jira

Extracts content from different Jira issue types: (Bug, CCB, Device Profile, Epic, Improvement, Information, Inquiry, New Feature, Question, etc.)
TBD

Image Added 

Extracts content from Salesforce including Accounts,
Campaings
Campaigns, Cases, Contracts, Contacts, Chatters, Documents, Groups, Ideas, Leads, Opportunities, Partners, Pricebooks, Products, Profiles, Solutions, Tasks, User, Knowledge Articles and Attachments.
TBD

Image Modified ServiceNow

Extracts content from ServiceNow including Knowledge Articles, Article Categories, Knowledge Bases, Attachments, ACLs, Users, and Catalog Items
TBD

Image RemovedSubversion

Extracts files from a subversion instance by crawling the head of the repositoryTBDImage Removed

Extracts content from an Adobe Experience Manager (AEM) server, including all page and asset objects

Image Added Veeva Vault

Extracts content from Veeva Vault using a Vault Query Language (VQL) statement.

  ☐

   ✓

 Kinesis  Fetches data from Amazon Kinesis Data Streams.

   ✓

   ☐

TBD

CRM

Image Modified

RightNow

 RightNow

Extracts content from a RightNow instance including Answers, Attachments, and Incidents
TBD

Web Crawler

Extracts
Extract pages and documents from
web sites
websites by following links inside HTML pages. Static
web sites
websites supported. Multiple Authentication mechanisms.
TBD

Image Modified Selenium Crawler

Extracts
Extract pages and documents from
web sites
websites by following links inside HTML pages. Dynamic
web sites
websites supported. Uses the Selenium framework to render the pages in real browser instances. Highly flexible crawling by scripting behaviors on the browser.
TBD

Social Networks

Image Modified

Jive

 Jive

Extracts content from any Jive Community using REST API v3. Includes documents stored in spaces, groups, projects, blogs, and any sub-folders.
TBD

Image Modified

Twitter

 Twitter

Extracts tweets and metadata from any
twitter
Twitter account,
includes
including Tweet Text, URL Links, Geo Location, Hashtags, User mentions, Media entities, Retweet count
TBD

Image Modified

Yammer

 Yammer

Extracts content from Yammer messages by Group, Thread, and Topic.
TBD

NoSQL Database

Image Modified

HBase

 HBase

Extracts content stored in the objectData field of the tables in an HBase server.
TBD

Image Removed
Extracts documents stored in an Elasticsearch index using a query to filter the documents to extract.
TBD

Identity Providers

Group Expansion
Given a list of users with group memberships, recursively expand the group membership information to compute the complete list of group memberships for any given user.5.0
The Group Expansion connector can crawl and expand identities from the Identity Cache. 

LDAP Identity
LDAP
Retrieves users, groups, and memberships from any LDAP server
TBD

Image Modified Atlassian Confluence

Identities
Retrieves users, groups, and memberships stored in Confluence repositories.
TBD

Image Modified Azure

Active Directory
Retrieves users, groups, and memberships from Azure Active Directory.
TBD

Other

Image Modified MS Exchange

Extracts content from the Exchange Servers including Mail (and attachments), Calendar and Contact
TBD

The REST connector can retrieve data from any JSON-based REST endpoint.