Aspire Features

Created by user-1b188 on May 30, 2018

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Performance and Reliability

- Distributed processing and automatic threading
- The ability to split document processing jobs into sub jobs that can run in parallel
- Standard technology for managing and restarting processes on servers for high availability
- Can be placed within an architecture for Backup Failover

Ease of Administration

- Making dynamic (on-the-fly) configuration changes
- Dynamically adding new components
  - Dynamic refresh of component code
  - Rich built-in XML processing methods including XPath and XSLT
  - Hierarchical component configuration
- Rich and comprehensive web-based administration and control interface

Strong Developer Environment

- Intuitive workflow interface
- Supports processing content in diverse languages
- Easy mapping of document fields to search fields
- Rich built-in JSON and XML processing methods, including XPath, XSLT
- Use of scripting to build complex processing components
- Hierarchical component configuration
- Tightly integrated with Maven repositories for sharing and loading component code
- Sharing and loading component code
- Process streams of tokens, for performing text analytics
  - Entity extraction
  - Latent Semantic Analysis
  - Document vector creation and comparison
  - Topic Analysis

Support for Security

- Handle Proxy LDAP requests, including:
  - Authenticating users
  - Determining user group membership across a multitude of systems

Support to Federate Search Requests

- Distribute queries to multiple search engines
- Merge search results

Support for Hadoop

- Ability to write to HDFS
- Ability to include Aspire within Map/Reduce jobs

No labels