Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

Detects synonyms specified and generates a normalize token


Operates On:  Lexical Items with TOKEN

...

and possibly other flags as specified below.

Saga_is_recognizer
Recognizerfalse

...

  • Parameter
    summaryThe resource which contains the synonyms database
    namesynonyms
    requiredtrue
    • Explanation
Saga_config_stagecode
boundaryFlagstext block split
stageSynonynStage
languagejs
titleStage Config
"synonyms": "saga_provider:synonym_patterns"

Example Output

Description

saga_graph
Code Block
languagetext
V--------------[I'm the LUMINARY of this airbuscustomer]--------------V 
^---[I'm]---V-[the]-V-[LUMINARY]-V-[of]-V-[this]-V--[airbuscustomer]--^ 
^-[I]-V-[m]-^       ^--[cheese]--^               ^-[airplane]-^ 

...

The resource data is a database of synonyms patterns and the resulting tokens they produce..

Resource Format

Saga_jsoncode
TitleEntity Json Format
languagejs
"_id": "Q28260KGAAJGsBemSwA0nZTLXA",
"norm": "red",
"synonyms":[
   "cardinal", "coral", "crimson", "flaming", "glowing", "maroon", "rose", "wine", "red"
],
"options" : {
   "caseInsensitive" : true
},
"confAdjust": 0.95

. . . additional fields as needed go here . . . 
Note
  • Additional fielded data can be added to the record
As
  • ; as needed by downstream processes.

Fields

  • Parameter
    summaryThe normalize version of the word, if any token is match to any of the synonyms, the norm would be the new token created
    namenorm
    requiredtrue
  • Parameter
    summaryPossible tokens to be found in the graph.
    namesynonyms
    typestring array
    requiredtrue
    Parameter
    summarySpecifies the confidence level of the entry, independent of any patterns matched
    nameconfAdjust
    typeboolean
    requiredtrue
    This is the confidence of the entry, in comparison to all of the other entries. Essentially, the likelihood that this entity will be randomly encountered.
  • Parameter
    summaryOptions for each set of synonyms
    defaultempty json
    nameoptions
    typejson
    • Parameter
      summaryIndicates if the search for synonyms must be case insensitive
      defaulttrue
      namecaseInsensitive
      typeboolean
    parameter

Include Page

...

summaryDate in milliseconds of the last time the entry was updated
nameupdatedAt
typedate epoch
requiredtrue

Generic Resource Fields
Generic Resource Fields

...