Available Connectors
Please check below the connectors available to Aspire 4.0 and 5.0.
Repository Type | Connector | Description |
---|
Released Version | Aspire 4.x | Aspire 5.x |
---|
File System |
---|
FileSystem | File System | Extracts documents from a locally accessible File System path | ✓ |
---|
5.0✓ |
SMB | Extracts documents from remote sharing servers using the Server Message Block (SMB) protocol. | ✓ |
TBD✓ |
FTP | Extracts documents from remote servers using the File Transfer Protocol (FTP) |
TBD
Image Removed | Extracts documents from S3 buckets in any region on AWS |
TBD✓ | ✓ |
Image Modified
|
Box | Extracts documents from Box.com |
TBD✓ | ☐ |
Image Modified
|
HDFS | Extracts documents from the Hadoop Distributed File System (HDFS) via WebHDFS | ✓ |
TBD☐ |
Image Modified OneDrive
| Extracts documents from Microsoft OneDrive accounts |
TBD
Image Removed | Extracts documents from Microsoft Azure Data Lake Store cloud |
TBD
Image Removed | Extracts documents from Microsoft Azure Blob Storage service | ✓ |
TBD
Image Removed | Extracts documents from Microsoft Azure File Storage service |
TBD✓ | ✓ |
Events, Messaging and Streaming |
---|
Image Removed | Extract events from Microsoft Azure Events Hub service |
TBD
Image Removed | Extracts events from Apache Kafka event streaming platform |
TBD
Image Removed | Extracts records from Amazon Kinesis Data Streams | ✓ |
TBD☐ |
RSS | Extracts items from RSS feeds |
TBD✓ | ☐ |
Relational Databases | RDB via Table | Extracts content from Relational Database SQL queries, and performs incremental updates based on Update-Table queries. |
---|
TBD✓ | ✓ |
RDB via Snapshots | Extracts content from Relational Database SQL queries, and performs incremental update by using a content digest Snapshot table | ✓ |
TBD✓ |
Image Modified Database Server
| Scans all databases within a server, extracts table information from all databases and |
extract extracts rows from all tables. |
TBD✓ | ✓ |
Content Management Systems | Image Modified
|
---|
Documentum | Extracts documents stored in docbases, cabinets, folders, and sub-folders within Documentum |
TBD✓ | ☐ |
Image Modified Documentum DQL
| Extracts documents using DQL query language for full and incremental crawls. ACLs extraction is also expressed as DQL statements. |
TBD
Image RemovedSharePoint 2010Extracts documents from Microsoft SharePoint 2010 (sites, lists, external lists, folders, documents or list items, attachments) | TBD | | The Dropbox connector can crawl Pages, Folders and Files from a Dropbox repository. It does identity Crawling, can execute snapshot-based Incrementals and respects document hierarchy. | ☐ | ✓ |
Image Added
|
| Extracts documents from Microsoft SharePoint 2013 (sites, lists, external lists, folders, documents or list items, attachments) |
TBD✓ | ✓ |
Image Modified SharePoint 2016
| Extracts documents from Microsoft SharePoint 2016 (sites, lists, external lists, folders, documents or list items, attachments) |
TBD✓ | ✓ |
Image Modified SharePoint 2019
| Extracts documents from Microsoft SharePoint 2019 (sites, lists, external lists, folders, documents or list items, attachments) |
TBD Image RemovedLotus Notes
| Extracts documents from Lotus Notes repositories (Application and Mail Databases, Knowledge Base Documents, Mail and Attachments) | TBD |
☐ | ✓ |
| Extracts documents from Microsoft SharePoint Online (sites, lists, external lists, folders, documents or list items, attachments) |
5.0 | Collaboration | | Extracts documents from Confluence repositories, including |
---|
: spaces, blogs, pages, attachments, and comments |
TBD | ✓ | ✓ |
Image Added IBM
|
Image RemovedeRoom
| Extracts documents from an eRoom server instance (site) using the XML Query feature | TBD |
Image RemovedIBM | Extracts content from IBM Connections servers including Activities, Blogs, Bookmarks, Files, Forums, Wikis, Profiles, and Communities |
TBD✓ | ☐ |
Image Modified
|
Atlassian Image RemovedSocialcast
| Extracts content from any Socialcast Community server including messages, comments, attachments, conversations, polls, users, groups, streams likes, flags and badges | TBD |
Image RemovedTeamForge
| Extracts documents from TeamForge including projects, discussions, documents, releases, news, project pages, planning folders, repositories, tasks, trackers and wiki services | TBD |
Image Removed | Extracts content from different Jira issue types: (Bug, CCB, Device Profile, Epic, Improvement, Information, Inquiry, New Feature, Question, etc.) |
TBD | ✓ | ☐ |
Image Added
|
| Extracts content from Salesforce including Accounts, |
CampaingsCampaigns, Cases, Contracts, Contacts, Chatters, Documents, Groups, Ideas, Leads, Opportunities, Partners, Pricebooks, Products, Profiles, Solutions, Tasks, User, Knowledge Articles and Attachments. |
TBD✓ | ✓ |
Image Modified ServiceNow
| Extracts content from ServiceNow including Knowledge Articles, Article Categories, Knowledge Bases, Attachments, ACLs, Users, and Catalog Items |
TBD Image RemovedSubversion
| Extracts files from a subversion instance by crawling the head of the repository | TBD |
Image Removed✓ |
| Extracts content from an Adobe Experience Manager (AEM) server, including all page and asset objects |
TBD |
CRM | Image Modified
|
---|
RightNow | Extracts content from a RightNow instance including Answers, Attachments, and Incidents |
TBDExtracts Extract pages and documents from |
web sites websites by following links inside HTML pages. Static |
web sites websites supported. Multiple Authentication mechanisms. |
TBDExtracts Extract pages and documents from |
web sites websites by following links inside HTML pages. Dynamic |
web sites websites supported. Uses the Selenium framework to render the pages in real browser instances. Highly flexible crawling by scripting behaviors on the browser. |
TBD✓ | ✓ |
Social Networks | Image Modified
|
---|
Jive | Extracts content from any Jive Community using REST API v3. Includes documents stored in spaces, groups, projects, blogs, and any sub-folders. |
TBD✓ | ☐ |
Image Modified
|
Twitter | Extracts tweets and metadata from any |
twitter includes including Tweet Text, URL Links, Geo Location, Hashtags, User mentions, Media entities, Retweet count |
TBD✓ | ☐ |
Image Modified
|
Yammer | Extracts content from Yammer messages by Group, Thread, and Topic. |
TBD✓ | ☐ |
NoSQL Database | Image Modified
|
---|
HBase | Extracts content stored in the objectData field of the tables in an HBase server. |
TBD
Image Removed | Extracts documents stored in an Elasticsearch index using a query to filter the documents to extract. |
TBDGiven a list of users with group memberships, recursively expand the group membership information to compute the complete list of group memberships for any given user. | 5.0 | The Group Expansion connector can crawl and expand identities from the Identity Cache. | ☐ | ✓ |
LDAP Identity |
LDAP | Retrieves users, groups, and memberships from any LDAP server |
TBD Identities | Retrieves users, groups, and memberships stored in Confluence repositories. |
TBD Active Directory | Retrieves users, groups, and memberships from Azure Active Directory. |
TBD✓ | ✓ |
Other | Image Modified MS Exchange
| Extracts content from the Exchange Servers including Mail (and attachments), Calendar and Contact |
---|
TBD☐ | ☐ |
| The REST connector can retrieve data from any JSON-based REST endpoint. | ☐ | ✓ |