You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 4 Current »

Accepts Java regular expressions then tries to match those expressions against the tokens coming in the pipeline. When there is a match the text will be tagged with a Semantic Tag.  In case you need the regex to match several tokens you can use the Regex Recognizer, just be aware that it is heavier on processing.

Configuration

This stage does not require a specific configuration other then the basic.

Adding a Regex

Click on the  button to open the "Add new regex" dialog


  • Regex ( type=string | required ) - Regular expression
  • Options
    • Case Insensitive ( type=boolean | optional ) - Set to true if the pattern is not case sensitive.
    • Literal ( type=boolean | default=false | optional ) - When this flag is specified then the input string that specifies the pattern is treated as a sequence of literal characters. Metacharacters or escape sequences in the input sequence will be given no special meaning.
  • Confidence Adjustment ( type=double | default=1 | required ) - Adjustment factor to apply to the confidence value of 0.0 to 2.0 from (Applies for every pattern match).
    • 0.0 to < 1.0  decreases confidence value
    • 1.0 confidence value remains the same
    • > 1.0 to  2.0 increases confidence value


General Settings

The general settings can be accessed by clicking on

More settings could be displayed in the same dialog, it varies per recognizer.


  • Enable - Enable the processor to be use in pipelines.
  • Base Pipeline - Indicates the last stage, from a pipeline, needed by the recognizer.
  • Skip Flags ( optional ) - Lexical items flags to be ignored by this processor.
  • Boundary Flags  ( optional ) - List of vertex flags that indicate the beginning and end of a text block.
  • Required Flags ( optional ) - Lexical items flags required by every token to be processed.
  • At Least One Flag ( optional ) - Lexical items flags needed by every token to be processed.
  • Don't Process Flags ( optional ) - List of lexical items flags that are not processed. The difference with "Skip Flags" is that this will drop the path in the Saga graph, skip just skips the token and continues in the same path.
  • Confidence Adjustment - Adjustment factor to apply to the confidence value of 0.0 to 2.0 from (Applies for every match).
    • 0.0 to < 1.0  decreases confidence value
    • 1.0 confidence value remains the same
    • > 1.0 to  2.0 increases confidence value
  • Debug - Enable debug logging.

  • No labels