Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

Small description of the stage, and I mean a line of descriptionDetects synonyms specified and generates a normalize token


Operates On:  Lexical Items with TOKEN and possibly other flags as specified below.

Saga_is_recognizer
Recognizerfalse

...

  • Parameter
    summaryThe resource which contains the synonyms database
    namesynonyms
    requiredtrue
    • Explanation
Saga_config_stage
boundaryFlagstext block split
stageSynonynStage
titleStage Config
"synonyms": "saga_provider:synonym_patterns"

Example Output

Description

Saga_graph
V--------------[I'm the LUMINARY of this airbuscustomer]--------------V 
^---[I'm]---V-[the]-V-[LUMINARY]-V-[of]-V-[this]-V--[airbuscustomer]--^ 
^-[I]-V-[m]-^       ^--[cheese]--^               ^-[airplane]-^ 

...

Saga_json
TitleEntity Json Format
"_id": "Q28260KGAAJGsBemSwA0nZTLXA",
"norm": "red",
"synonyms":[
   "cardinal", "coral", "crimson", "flaming", "glowing", "maroon", "rose", "wine", "red"
],
"options" : {
   "caseInsensitive" : true
},
"confAdjust": 0.95

. . . additional fields as needed go here . . . 
Note
  • Additional fielded data can be added to the record
As
  • ; as needed by downstream processes.

Fields

  • Parameter
    summaryThe normalize version of the word, if any token is match to any of the synonyms, the norm would be the new token created
    namenorm
    requiredtrue
  • Parameter
    summaryPossible tokens to be found in the graph.
    namesynonyms
    typestring array
    requiredtrue
  • Parameter
    Parameter
    boolean
    summarySpecifies the confidence level of the entry, independent of any patterns matched
    nameconfAdjust
    type
    requiredtrue
    • This is the confidence of the entry, in comparison to all of the other entries. Essentially, the likelihood that this entity will be randomly encountered.
    summaryOptions for each set of synonyms
    defaultempty json
    nameoptions
    typejson
    • Parameter
      summaryIndicates if the search for synonyms must be case insensitive
      defaulttrue
      namecaseInsensitive
      typeboolean
    parameter

Include Page

...

summaryDate in milliseconds of the last time the entry was updated
nameupdatedAt
typedate epoch
requiredtrue

Generic Resource Fields
Generic Resource Fields

...