Page History
Performance and Reliability
- Distributed processing and automatic threading
- The ability to split document processing jobs into sub jobs that can run in parallel
- Standard technology for managing and restarting processes on servers for high availability
- Can be placed within an architecture for Backup Failover
Ease of Administration
- Making dynamic (on-the-fly) configuration changes
- Dynamically adding new components
- Dynamic refresh of component code
- Rich built-in XML processing methods including XPath and XSLT
- Hierarchical component configuration
- Rich and comprehensive web-based administration and control interface
Strong Developer Environment
- Intuitive workflow interface
- Supports processing content in diverse languages
- Easy mapping of document fields to search fields
- Rich built-in JSON and XML processing methods, including XPath, XSLT
- Use of scripting to build complex processing components
- Hierarchical component configuration
- Tightly integrated with Maven repositories for sharing and loading component code
- Sharing and loading component code
- Process streams of tokens, for performing text analytics
- Entity extraction
- Latent Semantic Analysis
- Document vector creation and comparison
- Topic Analysis
Support for Security
- Handle Proxy LDAP requests, including:
- Authenticating users
- Determining user group membership across a multitude of systems
- Handle Proxy LDAP requests, including:
Support to Federate Search Requests
- Distribute queries to multiple search engines
- Merge search results
Support for Hadoop
- Ability to write to HDFS
- Ability to include Aspire within Map/Reduce jobs
Overview
Content Tools