Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

This recognizer works in a similar way to the Entity Recognizer in the sense that looks up sequences of tokens in a dictionary to match the text being processed. The difference is that it will also include in the matching text N tokens to the right and/or left of the original matched text. 

Info

Uses Token Matcher Recognizer Stage

Configuration

  • Parameter
    summaryWhen true, the recognizer will create internal tags to tag the left part, the matched text and the right part
    defaultfalse
    nameGroup tokens in its own tag
    typeboolean
  • Parameter
    summaryInternal Tag name for the matched text portion
    default_matchedTokens_
    nameMatched tokens tagname
  • Parameter
    summaryInternal Tag name for the tokens at the left of the matched text
    default_leftTokens_
    nameLeft tokens tag name
  • Parameter
    summaryInternal Tag name for the tokens at the right of the matched text
    default_rightTokens_
    nameRight tokens tag name


General Settings

Include Page
Generic Recognizer Config
Generic Recognizer Config


Adding a Pattern

Click on the Image Removedthe Image Added button to open the "Add new Pattern" dialog

Image RemovedImage Added


  • Parameter
    summarySpace separated list of tags and/or tokens to be matched
    namePattern
    requiredtrue
  • Options
    • Parameter
      summaryNumber of tokens to take to the left of the matched text
      default0
      nameMax tokens to the left
      typeinteger
    • Parameter
      summaryNumber of tokens to take a the right of the matched text
      default3
      nameMax tokens to the right
      typeinteger
  • Parameter
    summaryAdjustment factor to apply to the confidence value of 0.0 to 2.0 from (Applies for every pattern match).
    default1
    nameConfidence Adjustment
    typedouble
    requiredtrue
    • 0.0 to < 1.0  decreases confidence value
    • 1.0 confidence value remains the same
    • > 1.0 to  2.0 increases confidence value


Image RemovedImage Added