Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.


Easy Heading Free
navigationTitleOn this Page
wrapNavigationTexttrue
navigationExpandOptionexpand-all-by-default

The Elasticsearch Cache Lookup Content Type Detector App Bundle is a workflow component for Aspire.
                                                     Content Type Detector
Factory Namecom.accenture.aspire:content-type-detector
subTypejob-input
InputsThe field from which you want to get the value and a field to be created in the Aspire document.
OutputsAspire object that contains a subjob with metadata and checked out content extracted from the file.

Configuration


This section lists all configuration parameters available to configure the Content Type Detector component.

ElementTypeDefaultDescription
GeneralIgnore Delete JobsbooleanTrueOption to skip delete jobs.
Fetch filebooleanFalseSelect if you need to fetch a file.
Use default document pathbooleanTrueSelect so that Aspire will use the fetchUrl or displayUrl as the location of the file.
Document fetch pathNoneNoneLocation in the Aspire document of the path to the file to fetch.
Max Lookahead in MBytes for type detectiontextNoMaximum to consume the file stream to detect the type.
Max percent of column variability to allow in text separated filestextNoMaximum percentage of variability to allow in the number of columns.
Apache Tika configuration pathtextNoPath for Apache Tika configuration file.

Example

Configuration

Configuration 

Code Block
themeRDark
titlePUT aspire/_api/credentials/2a5ca234-e328-4d40-bb2a-2df3e550b065
"General":[
    {
         "ignoreDeleteJobs": true,
        "enableFetchUrl": falsetrue,
        "defaultFetchPath": true,
        "fetchPath": "/doc/fetchUrl",
        "maxLookaheadSize": 0.5,
        "variabilityPercent": 0,
        "tikaConfig": "/path/to/tikaConfig.xml" 
      }
],