Available Connectors
Please check below the connectors available to Aspire 4.0 and 5.0.
Repository Type | Connector | Description |
---|
Released Version | Aspire 4.x | Aspire 5.x |
---|
File System |
---|
FileSystem | File System | Extracts documents from a locally accessible File System path | ✓ |
---|
5.0✓ |
SMB | Extracts documents from remote sharing servers using the Server Message Block (SMB) protocol. | ✓ |
TBD✓ |
FTP | Extracts documents from remote servers using the File Transfer Protocol (FTP) | ✓ |
TBDImage Removed | Extracts documents from S3 buckets in any region on AWS | ✓ |
TBD✓ |
Image Modified |
Box | Extracts documents from Box.com | ✓ |
TBD☐ |
Image Modified |
HDFS | Extracts documents from the Hadoop Distributed File System (HDFS) via WebHDFS | ✓ |
TBD☐ |
Image Modified OneDrive | Extracts documents from Microsoft OneDrive accounts | ✓ |
TBDImage Removed | Extracts documents from Microsoft Azure Data Lake Store cloud | ✓ |
TBDImage Removed | Extracts documents from Microsoft Azure Blob Storage service | ✓ |
TBDImage Removed | Extracts documents from Microsoft Azure File Storage service | ✓ |
TBD✓ |
Events, Messaging and Streaming |
---|
Image Removed | Extract events from Microsoft Azure Events Hub service | ✓ |
TBDImage Removed | Extracts events from Apache Kafka event streaming platform | ✓ |
TBDImage Removed | Extracts records from Amazon Kinesis Data Streams |
TBD✓ | ☐ |
RSS | Extracts items from RSS feeds |
TBD✓ | ☐ |
Relational Databases | RDB via Table | Extracts content from Relational Database SQL queries, and performs incremental updates based on Update-Table queries. |
---|
TBD✓ | ✓ |
RDB via Snapshots | Extracts content from Relational Database SQL queries, and performs incremental update by using a content digest Snapshot table | ✓ |
TBD✓ |
Image Modified Database Server | Scans all databases within a server, extracts table information from all databases and |
extract extracts rows from all tables. | ✓ |
TBD✓ |
Content Management Systems | Image Modified |
---|
Documentum | Extracts documents stored in docbases, cabinets, folders, and sub-folders within Documentum | ✓ |
TBD☐ |
Image Modified Documentum DQL | Extracts documents using DQL query language for full and incremental crawls. ACLs extraction is also expressed as DQL statements. | ✓ |
TBDImage RemovedSharePoint 2010Extracts documents from Microsoft SharePoint 2010 (sites, lists, external lists, folders, documents or list items, attachments) | TBD | | The Dropbox connector can crawl Pages, Folders and Files from a Dropbox repository. It does identity Crawling, can execute snapshot-based Incrementals and respects document hierarchy. | ☐ | ✓ |
Image Added |
| Extracts documents from Microsoft SharePoint 2013 (sites, lists, external lists, folders, documents or list items, attachments) |
TBD✓ | ✓ |
Image Modified SharePoint 2016 | Extracts documents from Microsoft SharePoint 2016 (sites, lists, external lists, folders, documents or list items, attachments) |
TBD✓ | ✓ |
Image Modified SharePoint 2019 | Extracts documents from Microsoft SharePoint 2019 (sites, lists, external lists, folders, documents or list items, attachments) | ☐ |
TBDImage RemovedLotus Notes | Extracts documents from Lotus Notes repositories (Application and Mail Databases, Knowledge Base Documents, Mail and Attachments) | TBD |
✓ |
| Extracts documents from Microsoft SharePoint Online (sites, lists, external lists, folders, documents or list items, attachments) |
5.0 | Collaboration | | Extracts documents from Confluence repositories, including |
---|
: spaces, blogs, pages, attachments, and comments |
TBD | ✓ | ✓ |
Image Added IBM |
Image RemovedeRoom | Extracts documents from an eRoom server instance (site) using the XML Query feature | TBD |
Image RemovedIBM | Extracts content from IBM Connections servers including Activities, Blogs, Bookmarks, Files, Forums, Wikis, Profiles, and Communities | ✓ |
TBD☐ |
Image Modified |
Atlassian | Extracts content from different Jira issue types: (Bug, CCB, Device Profile, Epic, Improvement, Information, Inquiry, New Feature, Question, etc.) |
TBDImage RemovedSocialcast | Extracts content from any Socialcast Community server including messages, comments, attachments, conversations, polls, users, groups, streams likes, flags and badges | TBD |
Image RemovedTeamForge | Extracts documents from TeamForge including projects, discussions, documents, releases, news, project pages, planning folders, repositories, tasks, trackers and wiki services | TBD |
Image Removed☐ |
Image Added Salesforce | Extracts content from Salesforce including Accounts, |
CampaingsCampaigns, Cases, Contracts, Contacts, Chatters, Documents, Groups, Ideas, Leads, Opportunities, Partners, Pricebooks, Products, Profiles, Solutions, Tasks, User, Knowledge Articles and Attachments. | ✓ |
TBD✓ |
Image Modified ServiceNow | Extracts content from ServiceNow including Knowledge Articles, Article Categories, Knowledge Bases, Attachments, ACLs, Users, and Catalog Items | ✓ |
TBD✓ |
Image Modified |
SubversionTBD | Image Removed Adobe Experience Manager (AEM) server, including all page and asset objects | ✓ | ✓ |
Image Added Veeva Vault | Extracts content from Veeva Vault using a Vault Query Language (VQL) |
TBDTBDinstance including Answers, Attachments, and Incidents | ✓ | ☐ |
Web Crawler | |
---|
| Extract pages and documents from websites by following links inside HTML pages. Static websites supported. Multiple Authentication mechanisms. | ✓ | ✓ |
| Extract pages and documents from websites by following links inside HTML pages. Dynamic websites supported. Uses the Selenium framework to render the pages in real browser instances. Highly flexible crawling by scripting behaviors on the browser. | ✓ | ✓ |
TBD | Image RemovedSelenium Crawler | TBD |
Social Networks | Image Modified Jive | Extracts content from any Jive |
---|
TBD | Image RemovedTwitter | TBD | Community using REST API v3. Includes documents stored in spaces, groups, projects, blogs, and any sub-folders. | ✓ | ☐ |
Image Added Twitter | Extracts tweets and metadata from any Twitter account, including Tweet Text, URL Links, Geo Location, Hashtags, User mentions, Media entities, Retweet count | ✓ | ☐ |
Image Added Yammer | Extracts content from Yammer messages by Group, Thread, and Topic. | ✓ | ☐ |
Image RemovedYammer | TBD |
NoSQL Database | Image Modified HBase | Extracts content stored in the objectData field of the tables in an HBase |
---|
TBD | Image RemovedElasticsearch | server. | ✓ | ☐ |
| Extracts documents stored in an Elasticsearch index using a query to filter the documents to extract. | ✓ | ✓ |
TBD5.0 | The Group Expansion connector can crawl and expand identities from the Identity Cache. | ☐ | ✓ |
LDAP Identity | Retrieves users, groups, and memberships from any LDAP server | ✓ | ✓ |
| Retrieves users, groups, and memberships stored in Confluence repositories. | ✓ | ✓ |
| Retrieves users, groups, and memberships from Azure Active Directory. | ✓ | ✓ |
Other | Image Added MS Exchange | Extracts content from the Exchange Servers including Mail (and attachments), Calendar and Contact | ☐ | ☐ |
---|
| The REST connector can retrieve data from any JSON-based REST endpoint. | ☐ | ✓ |
LDAP | TBD | Image RemovedAtlassian Confluence Identities | TBD | Image RemovedAzure Active Directory | TBD | Other | Image RemovedExchange | TBD