Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Repository TypeConnectorDescription

Aspire 

4.0

Aspire 

5.0

File System








FileSystemExtracts documents from a locally accessible File System pathTBD

Yes

SMBExtracts documents from remote sharing servers using the Server Message Block (SMB) protocol.TBD

TBD

FTPExtracts documents from remote servers using the File Transfer Protocol (FTP)TBD

TBD

Amazon S3

Extracts documents from S3 buckets in any region on AWSTBD

TBD

Box.com

Extracts documents from Box.comTBD

TBD

HDFS

Extracts documents from the Hadoop Distributed File System (HDFS) via WebHDFSTBD

TBD

OneDrive

Extracts documents from Microsoft OneDrive accountsTBD

TBD

Azure Data Lake

Extracts documents from Microsoft Azure Data Lake Store cloudTBD

TBD

Azure Blob Storage

Extracts documents from Microsoft Azure Blob Storage serviceTBD

TBD

Azure File Storage

Extracts documents from Microsoft Azure File Storage serviceTBD

TBD

Events, Messaging and Streaming

Azure Events Hub

Extract events from Microsoft Azure Events Hub serviceTBD

TBD

Apache Kafka

Extracts events from Apache Kafka event streaming platformTBD

TBD

Amazon Kinesis

Extracts records from Amazon Kinesis Data StreamsTBD

TBD

RSSExtracts items from RSS feedsTBD

TBD

Relational DatabasesRDB via TableExtracts content from Relational Database SQL queries, and performs incremental updates based on Update-Table queries.TBD

TBD

RDB via SnapshotsExtracts content from Relational Database SQL queries, and performs incremental update by using a content digest Snapshot tableTBD

TBD

Database Server

Scans all databases within a server, extracts table information from all databases and extract rows from all tables.TBD

TBD

Content Management Systems

Documentum

Extracts documents stored in docbases, cabinets, folders and sub-folders within DocumentumTBD

TBD

Documentum DQL

Extracts documents using DQL query language for full and incremental crawls. ACLs extraction also expressed as DQL statements.TBD

TBD

SharePoint 2010

Extracts documents from Microsoft SharePoint 2010 (sites, lists, external lists, folders, documents or list items, attachments)TBDTBD

Image RemovedSharePoint 2013

Extracts documents from Microsoft SharePoint 2013 (sites, lists, external lists, folders, documents or list items, attachments)TBD

TBD

SharePoint 2016

Extracts documents from Microsoft SharePoint 2016 (sites, lists, external lists, folders, documents or list items, attachments)TBD

TBD

SharePoint 2019

Extracts documents from Microsoft SharePoint 2019 (sites, lists, external lists, folders, documents or list items, attachments)TBD

TBD

Extracts documents from Microsoft SharePoint Online (sites, lists, external lists, folders, documents or list items, attachments)TBD

Yes

Image RemovedLotus Notes

Extracts documents from Lotus Notes repositories (Application and Mail Databases, Knowledge Base Documents, Mail and Attachments)TBDTBD

Collaboration

Atlassian Confluence

Extracts documents from Confluence repositories, including: spaces, blogs, pages, attachments and commentsTBDTBD

Image RemovedeRoom

Extracts documents from an eRoom server instance (site) using the XML Query featureTBDTBD

IBM Connections

Extracts content from IBM Connections servers including Activities, Blogs, Bookmarks, Files, Forums, Wikis, Profiles and CommunitiesTBD

TBD

Atlassian Jira

Extracts content from different Jira issue types: (Bug, CCB, Device Profile, Epic, Improvement, Information, Inquiry, New Feature, Question, etc)TBDTBD

Image RemovedSocialcast

Extracts content from any Socialcast Community server including messages, comments, attachments, conversations, polls, users, groups, streams likes, flags and badgesTBDTBD

Image RemovedTeamForge

Extracts documents from TeamForge including projects, discussions, documents, releases, news, project pages, planning folders, repositories, tasks, trackers and wiki servicesTBDTBD

Salesforce

Extracts content from Salesforce including Accounts, Campaings, Cases, Contracts, Contacts, Chatters, Documents, Groups, Ideas, Leads, Opportunities, Partners, Pricebooks, Products, Profiles, Solutions, Tasks, User, Knowledge Articles and Attachments.TBD

TBD

ServiceNow

Extracts content from ServiceNow including Knowledge Articles, Article Categories, Knowledge Bases, Attachments, ACLs, Users and Catalog ItemsTBDTBD

Image RemovedSubversion

Extracts files from a subversion instance by crawling the head of the repositoryTBDTBD

Adobe Experience Manager (AEM)

Extracts content from an Adobe Experience Manager (AEM) server including all page and asset objectsTBD

TBD

CRM

RightNow

Extracts content from a RightNow instance including Answers, Attachments and IncidentsTBD

TBD

Web Crawler


Aspider Web Crawler

Extracts pages and documents from web sites by following links inside HTML pages. Static web sites supported. Multiple Authentication mechanismsTBD

TBD

Selenium Crawler

Extracts pages and documents from web sites by following links inside HTML pages. Dynamic web sites supported. Uses the Selenium framework to render the pages in real browser instances. Highly flexible crawling by scripting behaviors on the browser.TBD

TBD

Social Networks

Jive

Extracts content from any Jive Community using REST API v3. Includes documents stored in spaces, groups, projects, blogs and any sub-folders.TBD

TBD

Twitter

Extracts tweets and metadata from any twitter account, includes Tweet Text, URL Links, Geo Location, Hashtags, User mentions, Media entities, Retweet countTBD

TBD

Yammer

Extracts content from Yammer messages by Group, Thread and Topic.TBD

TBD

NoSQL Database

HBase

Extracts content stored in the objectData field of the tables in an HBase server.TBD

TBD

Elasticsearch

Extracts documents stored in an Elasticsearch index using a query to filter the documents to extract.TBD

TBD

Identity ProvidersGroup ExpansionGiven a list of users with group memberships, recursively expand the group membership information to compute the complete list of group memberships for any given user.TBD

Yes

LDAPRetrieves users, groups and memberships from any LDAP serverTBD

TBD

Atlassian Confluence Identities

Retrieves users, groups and memberships stored in Confluence repositories.TBD

TBD

Azure Active Directory

Retrieves users, groups and memberships from Azure Active Directory.TBD

TBD

Other

Exchange

Extracts content from the Exchange Servers including Mail (and attachments), Calendar and ContactTBD

TBD