You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 21 Next »

Available Connectors

Please check below the connectors available to Aspire 4.0 and 5.0.

Repository TypeConnectorDescription

Aspire 

4.0

Aspire 

5.0

File System








File SystemExtracts documents from a locally accessible File System path

SMBExtracts documents from remote sharing servers using the Server Message Block (SMB) protocol.

FTPExtracts documents from remote servers using the File Transfer Protocol (FTP)

Extracts documents from S3 buckets in any region on AWS

 Box.com

Extracts documents from Box.com

 HDFS

Extracts documents from the Hadoop Distributed File System (HDFS) via WebHDFS

 OneDrive

Extracts documents from Microsoft OneDrive accounts

 Azure Data Lake

Extracts documents from Microsoft Azure Data Lake Store cloud

 Azure Blob Storage

Extracts documents from Microsoft Azure Blob Storage service

 Azure File Storage

Extracts documents from Microsoft Azure File Storage service

Events, Messaging and Streaming

 Azure Events Hub

Extract events from Microsoft Azure Events Hub service

 Apache Kafka

Extracts events from Apache Kafka event streaming platform

 Amazon Kinesis

Extracts records from Amazon Kinesis Data Streams

RSSExtracts items from RSS feeds

Relational Databases

RDB via TableExtracts content from Relational Database SQL queries, and performs incremental updates based on Update-Table queries.

RDB via SnapshotsExtracts content from Relational Database SQL queries, and performs incremental update by using a content digest Snapshot table

 Database Server

Scans all databases within a server, extracts table information from all databases and extract rows from all tables.

February 2022

Content Management Systems

 Documentum

Extracts documents stored in docbases, cabinets, folders and sub-folders within Documentum

 Documentum DQL

Extracts documents using DQL query language for full and incremental crawls. ACLs extraction also expressed as DQL statements.

 SharePoint 2013

Extracts documents from Microsoft SharePoint 2013 (sites, lists, external lists, folders, documents or list items, attachments)

 SharePoint 2016

Extracts documents from Microsoft SharePoint 2016 (sites, lists, external lists, folders, documents or list items, attachments)

 SharePoint 2019

Extracts documents from Microsoft SharePoint 2019 (sites, lists, external lists, folders, documents or list items, attachments)

Extracts documents from Microsoft SharePoint Online (sites, lists, external lists, folders, documents or list items, attachments)

Collaboration

 Atlassian Confluence

Extracts documents from Confluence repositories, including: spaces, blogs, pages, attachments and comments

 IBM Connections

Extracts content from IBM Connections servers including Activities, Blogs, Bookmarks, Files, Forums, Wikis, Profiles and Communities

 Atlassian Jira

Extracts content from different Jira issue types: (Bug, CCB, Device Profile, Epic, Improvement, Information, Inquiry, New Feature, Question, etc)

 Salesforce

Extracts content from Salesforce including Accounts, Campaings, Cases, Contracts, Contacts, Chatters, Documents, Groups, Ideas, Leads, Opportunities, Partners, Pricebooks, Products, Profiles, Solutions, Tasks, User, Knowledge Articles and Attachments.

Extracts content from ServiceNow including Knowledge Articles, Article Categories, Knowledge Bases, Attachments, ACLs, Users and Catalog Items

 Adobe Experience Manager (AEM)

Extracts content from an Adobe Experience Manager (AEM) server including all page and asset objects

CRM

 RightNow

Extracts content from a RightNow instance including Answers, Attachments and Incidents

Web Crawler

Aspider Web Crawler

Extract pages and documents from websites by following links inside HTML pages. Static websites supported. Multiple Authentication mechanisms

February 2022

 Selenium Crawler

Extract pages and documents from websites by following links inside HTML pages. Dynamic websites supported. Uses the Selenium framework to render the pages in real browser instances. Highly flexible crawling by scripting behaviors on the browser.

February 2022

Social Networks

 Jive

Extracts content from any Jive Community using REST API v3. Includes documents stored in spaces, groups, projects, blogs and any sub-folders.

 Twitter

Extracts tweets and metadata from any twitter account, includes Tweet Text, URL Links, Geo Location, Hashtags, User mentions, Media entities, Retweet count

 Yammer

Extracts content from Yammer messages by Group, Thread and Topic.

NoSQL Database

 HBase

Extracts content stored in the objectData field of the tables in an HBase server.

 Elasticsearch

Extracts documents stored in an Elasticsearch index using a query to filter the documents to extract.

February 2022

Identity Providers

Group ExpansionGiven a list of users with group memberships, recursively expand the group membership information to compute the complete list of group memberships for any given user.

LDAPRetrieves users, groups and memberships from any LDAP server

 Atlassian Confluence Identities

Retrieves users, groups and memberships stored in Confluence repositories.

Retrieves users, groups and memberships from Azure Active Directory.

Other

 MS Exchange

Extracts content from the Exchange Servers including Mail (and attachments), Calendar and Contact


  • No labels