Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Repository TypeConnectorDescription
Released Version

Aspire 

4.0

Aspire 

5.0

File System








FileSystemExtracts documents from a locally accessible File System pathTBD
5.0
Yes
SMBExtracts documents from remote sharing servers using the Server Message Block (SMB) protocol.TBDTBD
FTPExtracts documents from remote servers using the File Transfer Protocol (FTP)TBDTBD

Image ModifiedAmazon S3

Extracts documents from S3 buckets in any region on AWSTBDTBD

Image ModifiedBox

Extracts documents from Box.comTBDTBD

Image ModifiedHDFS

Extracts documents from the Hadoop Distributed File System (HDFS) via WebHDFSTBDTBD

Image ModifiedOneDrive

Extracts documents from Microsoft OneDrive accountsTBDTBD

Image ModifiedAzure Data Lake

Extracts documents from Microsoft Azure Data Lake Store cloudTBDTBD

Image ModifiedAzure Blob Storage

Extracts documents from Microsoft Azure Blob Storage serviceTBDTBD

Image ModifiedAzure File Storage

Extracts documents from Microsoft Azure File Storage serviceTBDTBD
Events, Messaging and Streaming

Image ModifiedAzure Events Hub

Extract events from Microsoft Azure Events Hub serviceTBDTBD

Image ModifiedApache Kafka

Extracts events from Apache Kafka event streaming platformTBDTBD

Image ModifiedAmazon Kinesis

Extracts records from Amazon Kinesis Data StreamsTBDTBD
RSSExtracts items from RSS feedsTBDTBD
Relational DatabasesRDB via TableExtracts content from Relational Database SQL queries, and performs incremental updates based on Update-Table queries.TBDTBD
RDB via SnapshotsExtracts content from Relational Database SQL queries, and performs incremental update by using a content digest Snapshot tableTBDTBD

Image ModifiedDatabase Server

Scans all databases within a server, extracts table information from all databases and extract rows from all tables.TBDTBD
Content Management Systems

Image ModifiedDocumentum

Extracts documents stored in docbases, cabinets, folders and sub-folders within DocumentumTBDTBD

Image ModifiedDocumentum DQL

Extracts documents using DQL query language for full and incremental crawls. ACLs extraction also expressed as DQL statements.TBDTBD

Image ModifiedSharePoint 2010

Extracts documents from Microsoft SharePoint 2010 (sites, lists, external lists, folders, documents or list items, attachments)TBDTBD

Image ModifiedSharePoint 2013

Extracts documents from Microsoft SharePoint 2013 (sites, lists, external lists, folders, documents or list items, attachments)TBDTBD

Image ModifiedSharePoint 2016

Extracts documents from Microsoft SharePoint 2016 (sites, lists, external lists, folders, documents or list items, attachments)TBDTBD

Image ModifiedSharePoint 2019

Extracts documents from Microsoft SharePoint 2019 (sites, lists, external lists, folders, documents or list items, attachments)TBDTBD

Image ModifiedSharePoint Online

Extracts documents from Microsoft SharePoint Online (sites, lists, external lists, folders, documents or list items, attachments)TBD
5.0
Yes

Image ModifiedLotus Notes

Extracts documents from Lotus Notes repositories (Application and Mail Databases, Knowledge Base Documents, Mail and Attachments)TBDTBD
Collaboration

Image ModifiedAtlassian Confluence

Extracts documents from Confluence repositories, including: spaces, blogs, pages, attachments and commentsTBDTBD

Image ModifiedeRoom

Extracts documents from an eRoom server instance (site) using the XML Query featureTBDTBD

Image ModifiedIBM Connections

Extracts content from IBM Connections servers including Activities, Blogs, Bookmarks, Files, Forums, Wikis, Profiles and CommunitiesTBDTBD

Image ModifiedAtlassian Jira

Extracts content from different Jira issue types: (Bug, CCB, Device Profile, Epic, Improvement, Information, Inquiry, New Feature, Question, etc)TBDTBD

Image ModifiedSocialcast

Extracts content from any Socialcast Community server including messages, comments, attachments, conversations, polls, users, groups, streams likes, flags and badgesTBDTBD

Image ModifiedTeamForge

Extracts documents from TeamForge including projects, discussions, documents, releases, news, project pages, planning folders, repositories, tasks, trackers and wiki servicesTBDTBD

Image ModifiedSalesforce

Extracts content from Salesforce including Accounts, Campaings, Cases, Contracts, Contacts, Chatters, Documents, Groups, Ideas, Leads, Opportunities, Partners, Pricebooks, Products, Profiles, Solutions, Tasks, User, Knowledge Articles and Attachments.TBDTBD

Image ModifiedServiceNow

Extracts content from ServiceNow including Knowledge Articles, Article Categories, Knowledge Bases, Attachments, ACLs, Users and Catalog ItemsTBDTBD

Image ModifiedSubversion

Extracts files from a subversion instance by crawling the head of the repositoryTBDTBD

Image ModifiedAdobe Experience Manager (AEM)

Extracts content from an Adobe Experience Manager (AEM) server including all page and asset objectsTBDTBD
CRM

Image ModifiedRightNow

Extracts content from a RightNow instance including Answers, Attachments and IncidentsTBDTBD
Web Crawler


Aspider

Extracts pages and documents from web sites by following links inside HTML pages. Static web sites supported. Multiple Authentication mechanismsTBDTBD

Image ModifiedSelenium Crawler

Extracts pages and documents from web sites by following links inside HTML pages. Dynamic web sites supported. Uses the Selenium framework to render the pages in real browser instances. Highly flexible crawling by scripting behaviors on the browser.TBDTBD
Social Networks

Image ModifiedJive

Extracts content from any Jive Community using REST API v3. Includes documents stored in spaces, groups, projects, blogs and any sub-folders.TBDTBD

Image ModifiedTwitter

Extracts tweets and metadata from any twitter account, includes Tweet Text, URL Links, Geo Location, Hashtags, User mentions, Media entities, Retweet countTBDTBD

Image ModifiedYammer

Extracts content from Yammer messages by Group, Thread and Topic.TBDTBD
NoSQL Database

Image ModifiedHBase

Extracts content stored in the objectData field of the tables in an HBase server.TBDTBD

Image ModifiedElasticsearch

Extracts documents stored in an Elasticsearch index using a query to filter the documents to extract.TBDTBD
Identity ProvidersGroup ExpansionGiven a list of users with group memberships, recursively expand the group membership information to compute the complete list of group memberships for any given user.TBD
5.0
Yes
LDAPRetrieves users, groups and memberships from any LDAP serverTBDTBD

Image ModifiedAtlassian Confluence Identities

Retrieves users, groups and memberships stored in Confluence repositories.TBDTBD

Image ModifiedAzure Active Directory

Retrieves users, groups and memberships from Azure Active Directory.TBDTBD
Other

Image ModifiedExchange

Extracts content from the Exchange Servers including Mail (and attachments), Calendar and ContactTBDTBD