This is the list of Please check below the connectors available to Aspire 4.0 and 5.0.
Repository Type | Connector | Description | Aspire 4. | 0x | Aspire 5. | 0x | ||
---|---|---|---|---|---|---|---|---|
File System | FileSystemFile System | Extracts documents from a locally accessible File System path | TBD✓ | Yes✓ | ||||
SMB | Extracts documents from remote sharing servers using the Server Message Block (SMB) protocol. | TBD✓ | TBD✓ | |||||
FTP | Extracts documents from remote servers using the File Transfer Protocol (FTP) | TBD✓ | TBD☐ | |||||
Extracts documents from S3 buckets in any region on AWS | TBD✓ | TBD✓ | ||||||
Box.com | Extracts documents from Box.com | TBD✓ | TBD☐ | |||||
HDFS | Extracts documents from the Hadoop Distributed File System (HDFS) via WebHDFS | TBD✓ | TBD☐ | |||||
Extracts documents from Microsoft OneDrive accounts | TBD✓ | TBD✓ | ||||||
Extracts documents from Microsoft Azure Data Lake Store cloud | TBD✓ | TBD✓ | ||||||
Extracts documents from Microsoft Azure Blob Storage service | TBD✓ | TBD✓ | ||||||
Extracts documents from Microsoft Azure File Storage service | TBD✓ | TBD✓ | ||||||
Events, Messaging and Streaming | Extract events from Microsoft Azure Events Hub service | TBD✓ | TBD✓ | |||||
Extracts events from Apache Kafka event streaming platform | TBD✓ | TBD✓ | ||||||
Extracts records from Amazon Kinesis Data Streams | TBD✓ | TBD☐ | ||||||
RSS | Extracts items from RSS feeds | TBD✓ | TBD☐ | |||||
Relational Databases | RDB via Table | Extracts content from Relational Database SQL queries, and performs incremental updates based on Update-Table queries. | TBD✓ | TBD✓ | ||||
RDB via Snapshots | Extracts content from Relational Database SQL queries, and performs incremental update by using a content digest Snapshot table | TBD✓ | TBD✓ | |||||
Scans all databases within a server, extracts table information from all databases and | extractextracts rows from all tables. | TBD✓ | TBD✓ | |||||
Content Management Systems | Documentum | Extracts documents stored in docbases, cabinets, folders, and sub-folders within Documentum | TBD✓ | TBD☐ | ||||
Extracts documents using DQL query language for full and incremental crawls. ACLs extraction is also expressed as DQL statements. | TBD✓ | TBD✓ | ||||||
The Dropbox connector can crawl Pages, Folders and Files from a Dropbox repository. It does identity Crawling, can execute snapshot-based Incrementals and respects document hierarchy. | ☐ | ✓ | ||||||
Extracts documents from Microsoft SharePoint 2010 (sites, lists, external lists, folders, documents or list items, attachments) | TBD | TBD |
| Extracts documents from Microsoft SharePoint 2013 (sites, lists, external lists, folders, documents or list items, attachments) | TBD✓ | TBD✓ | ||
Extracts documents from Microsoft SharePoint 2016 (sites, lists, external lists, folders, documents or list items, attachments) | TBD✓ | TBD✓ | ||||||
Extracts documents from Microsoft SharePoint 2019 (sites, lists, external lists, folders, documents or list items, attachments) | TBD☐ | TBD✓ | ||||||
Extracts documents from Microsoft SharePoint Online (sites, lists, external lists, folders, documents or list items, attachments) | TBD | Yes | ||||||
Lotus Notes | Extracts documents from Lotus Notes repositories (Application and Mail Databases, Knowledge Base Documents, Mail and Attachments) | TBD | TBD | |||||
✓ | ✓ | |||||||
Collaboration | Extracts documents from Confluence repositories, including | :spaces, blogs, pages, attachments, and comments | TBD✓ | TBD | ||||
eRoom | Extracts documents from an eRoom server instance (site) using the XML Query feature | TBD | TBD | |||||
✓ | ||||||||
IBM | IBM Connections | Extracts content from IBM Connections servers including Activities, Blogs, Bookmarks, Files, Forums, Wikis, Profiles, and Communities | TBD✓ | TBD☐ | ||||
Atlassian Jira | Extracts content from different Jira issue types: (Bug, CCB, Device Profile, Epic, Improvement, Information, Inquiry, New Feature, Question, etc.) | TBD✓ | TBD | |||||
Socialcast | Extracts content from any Socialcast Community server including messages, comments, attachments, conversations, polls, users, groups, streams likes, flags and badges | TBD | TBD | |||||
TeamForge | Extracts documents from TeamForge including projects, discussions, documents, releases, news, project pages, planning folders, repositories, tasks, trackers and wiki services | TBD | TBD | |||||
☐ | ||||||||
| Extracts content from Salesforce including Accounts, | CampaingsCampaigns, Cases, Contracts, Contacts, Chatters, Documents, Groups, Ideas, Leads, Opportunities, Partners, Pricebooks, Products, Profiles, Solutions, Tasks, User, Knowledge Articles and Attachments. | TBD✓ | TBD✓ | ||||
Extracts content from ServiceNow including Knowledge Articles, Article Categories, Knowledge Bases, Attachments, ACLs, Users, and Catalog Items | TBD✓ | TBD✓ | ||||||
Extracts files from a subversion instance by crawling the head of the repository | TBD | TBD | Extracts content from an Adobe Experience Manager (AEM) server, including all page and asset objects | TBD | TBD✓ | ✓ | ||
Extracts content from Veeva Vault using a Vault Query Language (VQL) statement. | ☐ | ✓ | ||||||
Kinesis | Fetches data from Amazon Kinesis Data Streams. | ✓ | ☐ | |||||
CRM | RightNow | Extracts content from a RightNow instance including Answers, Attachments, and Incidents | TBD✓ | TBD☐ | ||||
Web Crawler | Extracts | Extract pages and documents from | web siteswebsites by following links inside HTML pages. Static | web siteswebsites supported. Multiple Authentication mechanisms. | TBD✓ | TBD✓ | ||
Extracts | Extract pages and documents from | web siteswebsites by following links inside HTML pages. Dynamic | web siteswebsites supported. Uses the Selenium framework to render the pages in real browser instances. Highly flexible crawling by scripting behaviors on the browser. | TBD✓ | TBD✓ | |||
Social Networks | Jive | Extracts content from any Jive Community using REST API v3. Includes documents stored in spaces, groups, projects, blogs, and any sub-folders. | TBD✓ | TBD☐ | ||||
Extracts tweets and metadata from any | twitterTwitter account, | includesincluding Tweet Text, URL Links, Geo Location, Hashtags, User mentions, Media entities, Retweet count | TBD✓ | TBD☐ | ||||
Yammer | Extracts content from Yammer messages by Group, Thread, and Topic. | TBD✓ | TBD☐ | |||||
NoSQL Database | HBase | Extracts content stored in the objectData field of the tables in an HBase server. | TBD✓ | TBD☐ | ||||
Extracts documents stored in an Elasticsearch index using a query to filter the documents to extract. | TBD✓ | TBD✓ | ||||||
Identity Providers | Group Expansion | Given a list of users with group memberships, recursively expand the group membership information to compute the complete list of group memberships for any given user. | TBD | Yes | The Group Expansion connector can crawl and expand identities from the Identity Cache. | ☐ | ✓ | |
LDAP Identity | LDAPRetrieves users, groups, and memberships from any LDAP server | TBD✓ | TBD✓ | |||||
Identities | Retrieves users, groups, and memberships stored in Confluence repositories. | TBD✓ | TBD✓ | |||||
Active Directory | Retrieves users, groups, and memberships from Azure Active Directory. | TBD✓ | TBD✓ | |||||
Other | MS Exchange | Extracts content from the Exchange Servers including Mail (and attachments), Calendar and Contact | TBD | TBD☐ | ☐ | |||
The REST connector can retrieve data from any JSON-based REST endpoint. | ☐ | ✓ |