Please check below the connectors available to Aspire 4.0 and 5.0.
Repository Type | Connector | Description | Aspire 4.0 | Aspire 5.0 |
---|---|---|---|---|
File System | FileSystem | Extracts documents from a locally accessible File System path | ✓ | ✓ |
SMB | Extracts documents from remote sharing servers using the Server Message Block (SMB) protocol. | ✓ | ☐Late November 2021 | |
FTP | Extracts documents from remote servers using the File Transfer Protocol (FTP) | ✓ | ☐ | |
Amazon Amazon S3 | Extracts documents from S3 buckets in any region on AWS | ✓ | ☐ | |
Box Box.com | Extracts documents from Box.com | ✓ | ☐ | |
HDFS HDFS | Extracts documents from the Hadoop Distributed File System (HDFS) via WebHDFS | ✓ | ☐ | |
OneDrive OneDrive | Extracts documents from Microsoft OneDrive accounts | ✓ | ☐ | |
Azure Azure Data Lake | Extracts documents from Microsoft Azure Data Lake Store cloud | ✓ | ☐ | |
Azure Azure Blob Storage | Extracts documents from Microsoft Azure Blob Storage service | ✓ | ☐ | |
Azure Azure File Storage | Extracts documents from Microsoft Azure File Storage service | ✓ | ☐ | |
Events, Messaging and Streaming | Azure Azure Events Hub | Extract events from Microsoft Azure Events Hub service | ✓ | ☐ |
Apache Apache Kafka | Extracts events from Apache Kafka event streaming platform | ✓ | ☐ | |
Amazon Amazon Kinesis | Extracts records from Amazon Kinesis Data Streams | ✓ | ☐ | |
RSS | Extracts items from RSS feeds | ✓ | ☐ | |
Relational Databases | RDB via Table | Extracts content from Relational Database SQL queries, and performs incremental updates based on Update-Table queries. | ✓ | ☐ Late November 2021 |
RDB via Snapshots | Extracts content from Relational Database SQL queries, and performs incremental update by using a content digest Snapshot table | ✓ | ☐ Late November 2021 | |
Database Database Server | Scans all databases within a server, extracts table information from all databases and extract rows from all tables. | ✓ | ☐ | |
Content Management Systems | Documentum Documentum | Extracts documents stored in docbases, cabinets, folders and sub-folders within Documentum | ✓ | ☐ |
Documentum Documentum DQL | Extracts documents using DQL query language for full and incremental crawls. ACLs extraction also expressed as DQL statements. | ✓ | ☐ | |
SharePoint SharePoint 2013 | Extracts documents from Microsoft SharePoint 2013 (sites, lists, external lists, folders, documents or list items, attachments) | ✓ | ☐ | |
SharePoint SharePoint 2016 | Extracts documents from Microsoft SharePoint 2016 (sites, lists, external lists, folders, documents or list items, attachments) | ✓ | ☐ | |
SharePoint SharePoint 2019 | Extracts documents from Microsoft SharePoint 2019 (sites, lists, external lists, folders, documents or list items, attachments) | ☐ | ☐ | |
Extracts documents from Microsoft SharePoint Online (sites, lists, external lists, folders, documents or list items, attachments) | ✓ | ✓ | ||
Collaboration | Atlassian Atlassian Confluence | Extracts documents from Confluence repositories, including: spaces, blogs, pages, attachments and comments | ✓ | ☐ |
IBM IBM Connections | Extracts content from IBM Connections servers including Activities, Blogs, Bookmarks, Files, Forums, Wikis, Profiles and Communities | ✓ | ☐ | |
Atlassian Atlassian Jira | Extracts content from different Jira issue types: (Bug, CCB, Device Profile, Epic, Improvement, Information, Inquiry, New Feature, Question, etc) | ✓ | ☐ | |
Salesforce Salesforce | Extracts content from Salesforce including Accounts, Campaings, Cases, Contracts, Contacts, Chatters, Documents, Groups, Ideas, Leads, Opportunities, Partners, Pricebooks, Products, Profiles, Solutions, Tasks, User, Knowledge Articles and Attachments. | ✓ | ☐ | |
ServiceNow ServiceNow | Extracts content from ServiceNow including Knowledge Articles, Article Categories, Knowledge Bases, Attachments, ACLs, Users and Catalog Items | ✓ | ☐ | |
Adobe Adobe Experience Manager (AEM) | Extracts content from an Adobe Experience Manager (AEM) server including all page and asset objects | ✓ | ☐ | |
CRM | RightNow RightNow | Extracts content from a RightNow instance including Answers, Attachments and Incidents | ✓ | ☐ |
Web Crawler | Aspider Web Crawler | Extracts Extract pages and documents from web sites websites by following links inside HTML pages. Static web sites websites supported. Multiple Authentication mechanisms | ✓ | ☐ Late November 2021 |
Selenium Selenium Crawler | Extracts Extract pages and documents from web sites websites by following links inside HTML pages. Dynamic web sites websites supported. Uses the Selenium framework to render the pages in real browser instances. Highly flexible crawling by scripting behaviors on the browser. | ✓ | ☐ Late November 2021 | |
Social Networks | Jive Jive | Extracts content from any Jive Community using REST API v3. Includes documents stored in spaces, groups, projects, blogs and any sub-folders. | ✓ | ☐ |
Twitter Twitter | Extracts tweets and metadata from any twitter account, includes Tweet Text, URL Links, Geo Location, Hashtags, User mentions, Media entities, Retweet count | ✓ | ☐ | |
Yammer Yammer | Extracts content from Yammer messages by Group, Thread and Topic. | ✓ | ☐ | |
NoSQL Database | HBase HBase | Extracts content stored in the objectData field of the tables in an HBase server. | ✓ | ☐ |
Elasticsearch Elasticsearch | Extracts documents stored in an Elasticsearch index using a query to filter the documents to extract. | ✓ | ☐ | |
Identity Providers | Group Expansion | Given a list of users with group memberships, recursively expand the group membership information to compute the complete list of group memberships for any given user. | ✓ | ✓ |
LDAP | Retrieves users, groups and memberships from any LDAP server | ✓ | ☐ | |
Atlassian Atlassian Confluence Identities | Retrieves users, groups and memberships stored in Confluence repositories. | ✓ | ☐ | |
Azure Azure Active Directory | Retrieves users, groups and memberships from Azure Active Directory. | ✓ | ☐ | |
Other | MS Exchange | Extracts content from the Exchange Servers including Mail (and attachments), Calendar and Contact | ☐ | ☐ |