Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

The

workflow is the successor of the Routing Table, with this new feature we can add, remove, change information, create branches for the flow of information, publish to different search engines and more.The Workflow is divide in two (2) parts,

Workflow is divided into two parts: the Workflow Library and the Workflow Tree

, please see Workflow Section.toc

Workflow Library



In the Workflow Library we you can find all the applications and rule rules to create our own business rules just by dragging and dropping into the Workflow Tree, all stored in a collapsible menu divide by categories. In the workflow we can find two ( 2 ) types of rules, application rules that are downloadable java applications such as the Publishers and Application Functions, the Publishers and Application Functions .

  • Applications Rules
    • Downloadable java components (Publishers , Applications and Big data)
  • Script Rules
    • Predefined templates of Groovy scripts. (Folder, Base Functions, Choices)

       
Note

The available Publishers, Applications and Big Data may change according to your

publishers and applications

entitlements

, and scripting rules, that are predefined templates of Groovy scripts.

 

CSC-Library.pngImage Removed

Categories

Both application rules and scripting rules are divided by categories, we are going to see all the categories and what can we expect to find in each one of them.

Publishers

Image Added


Categories

Folder

The folder is used to group a set of rules into one branch which later can be moved, shared or disabled.

Publishers

With the Publishers you

The Publishers are the same publishers from older versions of Aspire, with them we

can publish the jobs coming from the connector and

publish them to

into a search engine, each publisher must be configure before

be

it's added as a application rule. They are all downloadable java applications.

Application Functions

The Application Functions as the publishers are the same ones as older version of Aspire, they

Applications

The Applications are used to modify the jobs coming from connector in a more complex way that a simple Groovy script could had done. They are all downloadable java applications.

Big Data

The Big Data category holds all the big data related applications such as Co-occurrence, Hadoop Job Launcher, among others.

Base Functions

The

The Base Functions

are

 are scripting rules base on Groovy that can do simple functions to the jobs,

we can also find rule

it also has rules that can skip or stop the process of a job in the workflow.

We

You can also create

our

your own custom rules.

Choices

The

The Choices

are

 are scripting rules base on Groovy that allow us to create branches in the workflow base on a decision, this branches are only be executed by the jobs that comply with the made decision.

Local Library

The Local Library category, holds all the rules used in the current workflow, even the shared rules used in the workflow. Dragging the rules from this category will just create a reference to the existent rule 


Share Libraries

The Shared Libraries are

Image RemovedImage RemovedImage RemovedImage Removed

Share Libraries

The Shared Libraries are sets of rules and applications we save to use with other content source, if a rule or application is not in a share rule, we can't used again in other content source.

How to Share a Rule or Application?

This section walks through the steps necessary to share any rule or application with the enabled menu.

Step 1: Open the Context Menu

We right
  1. Right click over the rule or application we want to share, if it can be shared,
the
  1. the Share
option must
  1.  option should be enabled.
We put
  1.  
  2. Put the mouse over
the
  1. the Share, this should open the sub menu
where we can see the New Library option, and click on it
  1. Click the New Library option, this will open a window where we can put the name of the library.
 

Image Added

Step 2: Create the Share Library

After we enter
  1. Enter the name of the library
, we click on the Share button
  1. Click on the Share button, this will create the share library with the rule or application inside of it
. We can verify this by looking at the bottom of


Image Added


Step 2.1: Check the Share Library
  1. Go to the Workflow Library,
where the share library should be, if we click the share library it will open and we can see the rule or application.*
  1. the new category Libraries should be visible by now.
  2. Click on Libraries category to show all the shared libraries available, your new library should be visible.
  3. Click on your library to show the shared rules in the library.


Tip

You can delete a shared rule by clicking on the trashcan next to the name of the rule in the Workflow Library

Note

The share library may change position if we have several shared libraries, this is because is going to sorted by alphabetical order.

 

Image Added

Step 3: Share to a existent Share Library

If

we

you already have a share library and

we

you want to add a new rule or application to it,

we can repeat

repeat Step 1

and

 and instead of clicking

over

over New Library

we

 we can click over the name of our library, this will add the rule or application automatically.

 

Image Added

Step

3a

4: Unshare a Rule or

applicationIf

Application

  1. Right click over the rule or application we want to unshare
a rule or application, we can do this by clicking on the trashcan CSC-Trashcan.pngImage Removed, in the left side of the rule or application name inside of the share library.Image RemovedImage RemovedImage RemovedImage Removed
  1. , if it is shared, the Unshare option should be enabled. 
  2. Click the Unshare option, this will remove the rule from the shared library.

Image Added

Workflow Trees


The Workflow Trees represent specific point in the information flow, they are stages in which the jobs are processed before they are sent to the next one. The root node of each tree is always going to be named as the stage it represents. We have five ( 5 ) trees for the content source:

  • After Scan: 
    • Process documents before their content is fetched. Typically used to terminate jobs to avoid fetching unwanted documents.
  • On Add/Update: 
    • Process documents to be added or updated in the index. Typically used to map/normalize metadata fields and values.
  • On Delete: 
    • Delete events go through this workflow. Typically empty, this can be used to update an external repository as well.
  • On Error: 
    • Any job which encounters an error goes through this workflow. Could be used to log the error or quarantine the document.
  • On Publish: 
    • Publish documents to a search engine. Put your publisher in this workflow.

* The workflow trees are saved with every change you made, they are independent from the Save button


CSC-Workflow-Tree.png

Controls


IntroductionCSC-Workflow-Controls.pngImage Removed

CSC-Pipeline.pngImage Removed

Image Added

Rules Hierarchy Restrictions


For some rules there are some restrictions of which rule rules can contain which one, in the list below we can see which rules are containers of can contain which types of rules.

Root

  • Folder
  • Publishers
  • Application
Functions
  • Big Data
Basic
  • Base Functions
Choices
  • Choices (but not the Condition)
;

Folder

  • Choices
    • Condition (Only)
  • Condition
    • Choices
    • Folder
    • Exit (The only one from Basic Functions)
    • Folder
    • Publishers
    • Application
    • Big Data
    • Base Functions
    • Choices (but not the Condition)

    Condition

    Folder

    • Folder
    • Publishers
    • Application
    Functions
    • Big Data
    Basic
    • Base Functions
    Choices
    • Choices (but not the Condition)

    Choices

    • Condition (Only)

    Add

    the Condition);How to add

    a rule or application to the Workflow Tree

    This section walks through the steps necessary to add any rule or application to the Workflow Tree.

    Step 1: Drag from the Library

    You can add any rule or application just by dragging the rule from the Workflow Library and dropping it in the part of the Workflow Tree you want to put the rule or application.

    CSC-Drag1.pngImage RemovedCSC-Drag2.pngImage RemovedImage Added

    Step 2: Fill the properties

    If the rule needs to be configured, it will open a window with all the configuration properties, otherwise it just will appear in the tree. If a window is opened just , fill the necessary properties and click on the Add button to create the rule.

    CSC-Modal1.pngImage RemovedImage Added

    Step 2a: Update the properties

    By double clicking the rule on the tree, we can open the window it the current properties of the rule. Then we can change the properties and save them by clicking on the Update button.

    Image Added

    Context Menu


    The Context Menu has several option to manage the business rules:

    • Edit: Is the same functionality as double click, but for cases as a disabled rule where the double click doesn't work, you will need to use this option
    • Cut: Cuts Cut the current rule.
    • Copy: Copy the current rule.
    • Paste: If it is pasting from a cut, it just paste the business rule, but if it is pasting from a copy it creates a
      • Pasting from a Cut: It will just move the reference, of the rule
      • Pasting from a Copy: It will create a new rule with the same properties
      .
      • , and add a reference of the new rule
    • Paste Reference: (Enabled only after a copy) creates  Creates a reference to the rule we are coping. this copied rule. which means that if we change the rule is updated from one the reference or the original rule, all the reference pointing to that rule will changeshow the same changes.
    • Delete: Deletes the reference to the rule, if
      • If the rule isn't in a Share Library or it doesn't have another reference in the same or in another tree, it will also delete the rule.
    • Disable/Enable: Disables or enables the current rule or reference, if  
      • If the
      rule
      • reference is disabled it will be gray
      and if the rule disabled is a parent
      • out 
      • If the reference disabled has children, all the branch
      is inaccessible, this
      • will be
      displayed as the children with a gray description but a black icon.
      • inaccessible
    • Share / Unshare: (Enabled if Share the rule isn't already shared) Opens is not shared, Unshare if it is share).
      • If it is not shared, opens a sub-menu with the options to share
      .
        • New Library: Opens a window where you can specify the name of the new Share Library.
        • Other Libraries: (If any) Displays the name of the existent Share Libraries where we can put the rule.
      Unshare: (Enabled if the rule is already shared) Removes
      • If it is shared, removes the rule from the library, creates the rule in the current
      Share Library

    CSC-Context-Menu.pngImage Removed

    Workflow Rules and Application

    For the Workflow we have added several scripting rules and applications that we can use. In this section we are going to explain all the new rules we introduce in Aspire 2.0.

      • workflow and redirects the reference to that new rule.

    Image Added

    Custom Applications and Custom Script


    Custom Applications

    This includes the Publishers, for

    Applications

    The includes the Publishers, for

    which all related information can be found

    on

    on this page,

    and the Application Functions that

     Applications  that at the moment includes

    the

    the Hierarchy Extractor

    and Mime

     and Mime Type Normalizer

    .Custom Applications/ Publishers

    , and Big Data.

    By dragging and dropping the the Custom option  option of publishers or applications, we you will open a window where we from which you can choose between two methods to install a custom application/, publisher or big data, repository and configuration filesfile, both show shown as toggle buttons tabs on the top of the window.

    Repository

    The repository method is always the default one, with this option we can download the custom application

    /

    , publisher or big data from a maven repository. To install the custom application

    /

    , publisher

    with

    or big data you need to fill the following fields

    :
  • 'Name: This will be the name of the application/publisher in the system, and it must be unique, otherwise, we will receive an alert indicating that the name is already in use.
  • Description

    :

    This will be the description displayed in the tree and it will be the text which we will use to identify the application.

    1. Group
    ID
    1. Id: e.g. com.searchtechnologies.aspire
    2. Artifact
    ID
    1. Id: The id of the artifact representing the connector e.g. app-custom-connector
    2. Version: (Optional)
    If
    1.  If the version of the artifact isn't specify, Aspire will use the same version as it.
    CSC-Custom-Application1.pngImage Removed

    Image Added

    After got all the necessary fields, we click on continue and the application will be loaded in the same window.


    Image AddedCSC-Custom-Application2.pngImage Removed



    Note
    *
    • All the
    application/publisher
    • Applications, Publishers and Big Data added using this method will be added to their respective category in the Workflow Library.
    **
    • It is not recommended to use an older version of
    a connector is
    • anything if a new version is available.

    Configuration Files

    Before accessing the configuration file method an alert will be show indicating that the application/publisher added using this method are not going to be included in the respective category in

    the

    the Workflow Library.

    CSC-Custom-Application4.pngImage Removed

    Image Added

    The configuration files method requires to have both application file and dxf file (in

    the Aspire server. To install a custom application/publisher using this method we just have to specify the direction of the application file.

    this format <application>-dxf.xml) in the same folder on the server machine. To install the custom application, publisher or big data you need to fill the following fields:

    1. File Path: Enter the path to the application.xml file. It can be a full path to the file or a relative path from the aspire instance folder (e.g. config/application.xml)

    Image Added

    CSC-Custom-Application3.pngImage Removed

    After got all the necessary fields, we click on continue and the application will be loaded in the same window.* If

    Image Added


    Note

     If the dxf file doesn't have the new valid format for application

    /

    , publisher or big data, it won't be possible to

    configure

    open the

    application/publisher

    configuration.

    Custom Script

    By dragging and dropping the Custom option of Base Functions, you will open a window where we can add you own groovy script to manipulate the jobs.

    To add the custom script you need to fill the following fields:

    1. Name: Add your script's name
    2. Groovy Script: Enter your script

    Image AddedGroovy Script with Eclipse Theme

    Image AddedGroovy Script with Twilight Theme

    Tip
    • You can go into full screen by selecting the textarea, and pressing the F11 or the Esc key (only Esc key works on Mac OS X)
    • You can change the textarea theme from the Theme pulldown

    Scripting Rules

    Folder

    The Folder is a container where we can put all the business rules, then copy only the folder to other tree or other branch, and the folder will be copied with all the bussiness rules inside it, also if the content of the folder is modified, all its copies will be modified too.

    To create a folder we only need the description.

    Exit

    The Exit rule doesn't have any configurable properties, its function is only to stop further processing of the job in the tree.

    Job Terminate

    The Job Terminate doesn't have any configurable properties, its function is to termite the job, this means that there is no more process done to the job at all.

    Raise Exception

    The Raise Exception rule, creates and exception in the workflow. The only field need it for this rule is the message of the exception.

    Set String Value

    The set string value rule, sets the content of a field we choose with the string we want, if the field doesn't exist it creates one, if it does exist overrides the content.

    Custom Script

    The Custom rule open a window where we can put our own Groovy script, and used as a rule in the workflow, you can also click the text area and press F11 to make a full screen text area. For this rule we only need the description and the Groovy script we want to use.

    For more information please see Using Groovy.

    Condition

    The condition is a complement of the Choices, each condition represents the result obtain from the choice. It doesn't have any configurable properties and it can only be host by Choices.

    Boolean

    This choice return a boolean which is determined by the fact of if the content of the field we choose is equals to the value we expected. This type of choices must only have at most two (2) conditions (true, false)

    Switch

    This choice makes a decision base on if the content of the field matches with one the conditions it has, for example, if the field we choose is repType, and the possible values for it are "document", "attachment" and "blog", we can put three (3) conditions that has "document", "attachment" and "blog".

    For this rule we only need the field we want to test.

    Boolean (Byte array)

    This choice return a boolean which is determined by the fact of if the content of the field we choose is equals to the value we expected (The value and the content of the filed must be a byte arrays). This type of choices must only have at most two (2) conditions (true, false)

    Switch (Byte array)

    This choice makes a decision base on if the content of the field matches with one the conditions it has (The conditions and the content of the filed must be a byte arrays).

    For this rule we only need the field we want to test.

    Exclude-By-Name

    This choice checks if the file name match or not with the pattern the user entered.

    For this rule, we have to check the field: matches if we want the pattern makes match with the file name on the contrary we don't have to check it. Also in the field pattern we have to set a regex with the pattern we want to filter.

    Exclude-By-File-Ext

    This choice checks if the file extension match or not with the pattern the user entered.

    For this rule, we have to check the field: matches if we want the pattern makes match with the file extension on the contrary we don't have to check it. Also in the field pattern we have to set a regex with the pattern we want to filter.

    Exclude-By-File-Size

    This choice checks if the file size is in the range the user specified. The values of the range have to be set in bytes.

    For this rule, we have to set the fields: min and max in bytes. If you specified -1 in either of that fields it will means infinite.

    So, for example if you set field: min = 1000 and max = -1, it will filter all files where dataSize is major or equal than 1000 bytes.

    CSC-Modal-ExcludeByFileSize.pngImage Removed