Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

Looks up sequences of tokens in a dictionary and then tags the sequence with one or more semantic tags as an alternative representation. Typically, these tags represent entities such as {person}, {place}, {company}, etc.

Info

Uses Dictionary Tagger Stage

Configuration

Image RemovedImage Added


  • Parameter
    summaryList of tags to be ignored by the recognizer
    nameIgnore Tags
    typestring array
  • Parameter
    summaryReplace characters such as "á, ö, ç" with their normalize forms "a, o, c"
    defaultunchecked
    nameRemove Accents & Diacritics
    typeboolean
  • Parameter
    summaryReplace the characters specified with white space
    defaultunchecked
    nameRemove Characters
    typeboolean
    • Parameter
      summaryWill only be used if Remove Characters is checked
      default_-‿⁀⁔︳︴﹍﹎﹏_
      nameCharacters to remove
  • Parameter
    summaryActivates spellchecking to recognize entities misspelled.
    defaultunchecked
    nameUse Spellchecking
    typeboolean
  • Parameter
    summaryCosine factor for accepting entities.
    default0.7
    nameVector Cosine Similarity Acceptance
    typedouble

Adding a Entity

Click on the  button which will popup the "Add new Entity" dialog

Image RemovedImage Added


  • Parameter
    summaryPatterns to look for
    namePatterns
    typestring array
    requiredtrue
  • Parameter
    summaryID assign to the set of patterns
    defaultautogenerated
    nameID
    requiredtrue
    • Normally use to match the ID in a database or a key for an API
  • Parameter
    summaryDisplay use for normalization
    nameDisplay
    requiredtrue
  • Parameter
    summaryAdjustment factor to apply to the confidence value of 0.0 to 2.0 from (Applies for every pattern match).
    default1
    nameConfidence Adjustment
    typedouble
    requiredtrue
    • 0.0 to < 1.0  decreases confidence value
    • 1.0 confidence value remains the same
    • > 1.0 to  2.0 increases confidence value
  • The final space is for more custom configurations.

General Settings

Include Page
Generic Recognizer Config
Generic Recognizer Config