Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Available Connectors

Please check below the connectors available to Aspire 4.0 and 5.0.

Repository TypeConnectorDescription

Aspire 

4.0

Aspire 

5.0

File System








FileSystemExtracts documents from a locally accessible File System path

SMBExtracts documents from remote sharing servers using the Server Message Block (SMB) protocol.

Late November 2021

FTPExtracts documents from remote servers using the File Transfer Protocol (FTP)

Amazon  Amazon S3

Extracts documents from S3 buckets in any region on AWS

Box Box.com

Extracts documents from Box.com

HDFS HDFS

Extracts documents from the Hadoop Distributed File System (HDFS) via WebHDFS

OneDrive OneDrive

Extracts documents from Microsoft OneDrive accounts

Azure  Azure Data Lake

Extracts documents from Microsoft Azure Data Lake Store cloud

Azure  Azure Blob Storage

Extracts documents from Microsoft Azure Blob Storage service

Azure  Azure File Storage

Extracts documents from Microsoft Azure File Storage service

Events, Messaging and Streaming

Azure  Azure Events Hub

Extract events from Microsoft Azure Events Hub service

Apache  Apache Kafka

Extracts events from Apache Kafka event streaming platform

Amazon  Amazon Kinesis

Extracts records from Amazon Kinesis Data Streams

RSSExtracts items from RSS feeds

Relational DatabasesRDB via TableExtracts content from Relational Database SQL queries, and performs incremental updates based on Update-Table queries.

Late November 2021

RDB via SnapshotsExtracts content from Relational Database SQL queries, and performs incremental update by using a content digest Snapshot table

Late November 2021

Database  Database Server

Scans all databases within a server, extracts table information from all databases and extract rows from all tables.

Content Management Systems

Documentum Documentum

Extracts documents stored in docbases, cabinets, folders and sub-folders within Documentum

Documentum  Documentum DQL

Extracts documents using DQL query language for full and incremental crawls. ACLs extraction also expressed as DQL statements.

SharePoint  SharePoint 2013

Extracts documents from Microsoft SharePoint 2013 (sites, lists, external lists, folders, documents or list items, attachments)

SharePoint  SharePoint 2016

Extracts documents from Microsoft SharePoint 2016 (sites, lists, external lists, folders, documents or list items, attachments)

SharePoint  SharePoint 2019

Extracts documents from Microsoft SharePoint 2019 (sites, lists, external lists, folders, documents or list items, attachments)

Extracts documents from Microsoft SharePoint Online (sites, lists, external lists, folders, documents or list items, attachments)

Collaboration

Atlassian  Atlassian Confluence

Extracts documents from Confluence repositories, including: spaces, blogs, pages, attachments and comments

IBM  IBM Connections

Extracts content from IBM Connections servers including Activities, Blogs, Bookmarks, Files, Forums, Wikis, Profiles and Communities

Atlassian  Atlassian Jira

Extracts content from different Jira issue types: (Bug, CCB, Device Profile, Epic, Improvement, Information, Inquiry, New Feature, Question, etc)

Salesforce Salesforce

Extracts content from Salesforce including Accounts, Campaings, Cases, Contracts, Contacts, Chatters, Documents, Groups, Ideas, Leads, Opportunities, Partners, Pricebooks, Products, Profiles, Solutions, Tasks, User, Knowledge Articles and Attachments.

ServiceNow ServiceNow

Extracts content from ServiceNow including Knowledge Articles, Article Categories, Knowledge Bases, Attachments, ACLs, Users and Catalog Items

Adobe  Adobe Experience Manager (AEM)

Extracts content from an Adobe Experience Manager (AEM) server including all page and asset objects

CRM

RightNow RightNow

Extracts content from a RightNow instance including Answers, Attachments and Incidents

Web Crawler


Aspider Web Crawler

Extracts Extract pages and documents from web sites websites by following links inside HTML pages. Static web sites websites supported. Multiple Authentication mechanisms

Late November 2021

Selenium  Selenium Crawler

Extracts Extract pages and documents from web sites websites by following links inside HTML pages. Dynamic web sites websites supported. Uses the Selenium framework to render the pages in real browser instances. Highly flexible crawling by scripting behaviors on the browser.

Late November 2021

Social Networks

Jive Jive

Extracts content from any Jive Community using REST API v3. Includes documents stored in spaces, groups, projects, blogs and any sub-folders.

Twitter Twitter

Extracts tweets and metadata from any twitter account, includes Tweet Text, URL Links, Geo Location, Hashtags, User mentions, Media entities, Retweet count

Yammer Yammer

Extracts content from Yammer messages by Group, Thread and Topic.

NoSQL Database

HBase HBase

Extracts content stored in the objectData field of the tables in an HBase server.

Elasticsearch Elasticsearch

Extracts documents stored in an Elasticsearch index using a query to filter the documents to extract.

Identity ProvidersGroup ExpansionGiven a list of users with group memberships, recursively expand the group membership information to compute the complete list of group memberships for any given user.

LDAPRetrieves users, groups and memberships from any LDAP server

Atlassian  Atlassian Confluence Identities

Retrieves users, groups and memberships stored in Confluence repositories.

Azure  Azure Active Directory

Retrieves users, groups and memberships from Azure Active Directory.

Other

 MS Exchange

Extracts content from the Exchange Servers including Mail (and attachments), Calendar and Contact