The Elasticsearch Cache Lookup Content Type Detector App Bundle is a workflow component for Aspire.

Content Type Detector
Factory Name	com.accenture.aspire:content-type-detector
subType	job-input
Inputs	The field from which you want to get the value and a field to be created in the Aspire document.
Outputs	Aspire object that contains a subjob with metadata and checked out content extracted from the file.

Configuration

This section lists all configuration parameters available to configure the Content Type Detector component.

	Element	Type	Default	Description
General	Ignore Delete Jobs	boolean	True	Option to skip delete jobs.
	Fetch file	boolean	False	Select if you need to fetch a file.
	Use default document path	boolean	True	Select so that Aspire will use the fetchUrl or displayUrl as the location of the file.
	Document fetch path	None	None	Location in the Aspire document of the path to the file to fetch.
	Max Lookahead in MBytes for type detection	text	No	Maximum to consume the file stream to detect the type.
	Max percent of column variability to allow in text separated files	text	No	Maximum percentage of variability to allow in the number of columns.
	Apache Tika configuration path	text	No	Path for Apache Tika configuration file.

Example

Configuration

Code Block

theme	RDark
title	PUT aspire/_api/credentials/2a5ca234-e328-4d40-bb2a-2df3e550b065

"General":[
    {         "ignoreDeleteJobs": true,
        "enableFetchUrl": true,
        "defaultFetchPath": true,
        "fetchPath": "/doc/fetchUrl",
        "maxLookaheadSize": 0.5,
        "variabilityPercent": 0,
        "tikaConfig": "/path/to/tikaConfig.xml"
     }
],

Page tree

Versions Compared

Old Version 8

New Version Current

Key

Configuration

Example

Configuration

Page tree

Page History

Versions Compared

Old Version 8

New Version Current

Key

Configuration

Example

Configuration