This is the list of connectors available to Aspire 4.0 and 5.0.
Repository Type | Connector | Description | Aspire 4.0 | Aspire 5.0 |
---|---|---|---|---|
File System | FileSystem | Extracts documents from a locally accessible File System path | TBD | Yes |
SMB | Extracts documents from remote sharing servers using the Server Message Block (SMB) protocol. | TBD | TBD | |
FTP | Extracts documents from remote servers using the File Transfer Protocol (FTP) | TBD | TBD | |
Amazon S3 | Extracts documents from S3 buckets in any region on AWS | TBD | TBD | |
Box | Extracts documents from Box.com | TBD | TBD | |
HDFS | Extracts documents from the Hadoop Distributed File System (HDFS) via WebHDFS | TBD | TBD | |
OneDrive | Extracts documents from Microsoft OneDrive accounts | TBD | TBD | |
Azure Data Lake | Extracts documents from Microsoft Azure Data Lake Store cloud | TBD | TBD | |
Azure Blob Storage | Extracts documents from Microsoft Azure Blob Storage service | TBD | TBD | |
Azure File Storage | Extracts documents from Microsoft Azure File Storage service | TBD | TBD | |
Events, Messaging and Streaming | Azure Events Hub | Extract events from Microsoft Azure Events Hub service | TBD | TBD |
Apache Kafka | Extracts events from Apache Kafka event streaming platform | TBD | TBD | |
Amazon Kinesis | Extracts records from Amazon Kinesis Data Streams | TBD | TBD | |
RSS | Extracts items from RSS feeds | TBD | TBD | |
Relational Databases | RDB via Table | Extracts content from Relational Database SQL queries, and performs incremental updates based on Update-Table queries. | TBD | TBD |
RDB via Snapshots | Extracts content from Relational Database SQL queries, and performs incremental update by using a content digest Snapshot table | TBD | TBD | |
Database Server | Scans all databases within a server, extracts table information from all databases and extract rows from all tables. | TBD | TBD | |
Content Management Systems | Documentum | Extracts documents stored in docbases, cabinets, folders and sub-folders within Documentum | TBD | TBD |
Documentum DQL | Extracts documents using DQL query language for full and incremental crawls. ACLs extraction also expressed as DQL statements. | TBD | TBD | |
SharePoint 2010 | Extracts documents from Microsoft SharePoint 2010 (sites, lists, external lists, folders, documents or list items, attachments) | TBD | TBD | |
SharePoint 2013 | Extracts documents from Microsoft SharePoint 2013 (sites, lists, external lists, folders, documents or list items, attachments) | TBD | TBD | |
SharePoint 2016 | Extracts documents from Microsoft SharePoint 2016 (sites, lists, external lists, folders, documents or list items, attachments) | TBD | TBD | |
SharePoint 2019 | Extracts documents from Microsoft SharePoint 2019 (sites, lists, external lists, folders, documents or list items, attachments) | TBD | TBD | |
Extracts documents from Microsoft SharePoint Online (sites, lists, external lists, folders, documents or list items, attachments) | TBD | Yes | ||
Lotus Notes | Extracts documents from Lotus Notes repositories (Application and Mail Databases, Knowledge Base Documents, Mail and Attachments) | TBD | TBD | |
Collaboration | Atlassian Confluence | Extracts documents from Confluence repositories, including: spaces, blogs, pages, attachments and comments | TBD | TBD |
eRoom | Extracts documents from an eRoom server instance (site) using the XML Query feature | TBD | TBD | |
IBM Connections | Extracts content from IBM Connections servers including Activities, Blogs, Bookmarks, Files, Forums, Wikis, Profiles and Communities | TBD | TBD | |
Atlassian Jira | Extracts content from different Jira issue types: (Bug, CCB, Device Profile, Epic, Improvement, Information, Inquiry, New Feature, Question, etc) | TBD | TBD | |
Socialcast | Extracts content from any Socialcast Community server including messages, comments, attachments, conversations, polls, users, groups, streams likes, flags and badges | TBD | TBD | |
TeamForge | Extracts documents from TeamForge including projects, discussions, documents, releases, news, project pages, planning folders, repositories, tasks, trackers and wiki services | TBD | TBD | |
Salesforce | Extracts content from Salesforce including Accounts, Campaings, Cases, Contracts, Contacts, Chatters, Documents, Groups, Ideas, Leads, Opportunities, Partners, Pricebooks, Products, Profiles, Solutions, Tasks, User, Knowledge Articles and Attachments. | TBD | TBD | |
ServiceNow | Extracts content from ServiceNow including Knowledge Articles, Article Categories, Knowledge Bases, Attachments, ACLs, Users and Catalog Items | TBD | TBD | |
Subversion | Extracts files from a subversion instance by crawling the head of the repository | TBD | TBD | |
Adobe Experience Manager (AEM) | Extracts content from an Adobe Experience Manager (AEM) server including all page and asset objects | TBD | TBD | |
CRM | RightNow | Extracts content from a RightNow instance including Answers, Attachments and Incidents | TBD | TBD |
Web Crawler | Aspider | Extracts pages and documents from web sites by following links inside HTML pages. Static web sites supported. Multiple Authentication mechanisms | TBD | TBD |
Selenium Crawler | Extracts pages and documents from web sites by following links inside HTML pages. Dynamic web sites supported. Uses the Selenium framework to render the pages in real browser instances. Highly flexible crawling by scripting behaviors on the browser. | TBD | TBD | |
Social Networks | Jive | Extracts content from any Jive Community using REST API v3. Includes documents stored in spaces, groups, projects, blogs and any sub-folders. | TBD | TBD |
Extracts tweets and metadata from any twitter account, includes Tweet Text, URL Links, Geo Location, Hashtags, User mentions, Media entities, Retweet count | TBD | TBD | ||
Yammer | Extracts content from Yammer messages by Group, Thread and Topic. | TBD | TBD | |
NoSQL Database | HBase | Extracts content stored in the objectData field of the tables in an HBase server. | TBD | TBD |
Elasticsearch | Extracts documents stored in an Elasticsearch index using a query to filter the documents to extract. | TBD | TBD | |
Identity Providers | Group Expansion | Given a list of users with group memberships, recursively expand the group membership information to compute the complete list of group memberships for any given user. | TBD | Yes |
LDAP | Retrieves users, groups and memberships from any LDAP server | TBD | TBD | |
Atlassian Confluence Identities | Retrieves users, groups and memberships stored in Confluence repositories. | TBD | TBD | |
Azure Active Directory | Retrieves users, groups and memberships from Azure Active Directory. | TBD | TBD | |
Other | Exchange | Extracts content from the Exchange Servers including Mail (and attachments), Calendar and Contact | TBD | TBD |