Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

This stage maintains a list tokens used to identify possible subjects of interest and suggest a URL reference along with "title" and "description". The title and description fields are used as display data.

Note

This stage extends from the Dictionary Tagger Stage.


Operates On:  Lexical Items with TOKEN and possibly other flags as specified below.

Saga_is_recognizer

Note

This stage extends from the Dictionary Tagger Stage.

Include Page
Generic Configuration Parameters
Generic Configuration Parameters

...

Saga_config_stage
boundaryFlagstext block split
requiredFlagstoken, semantic tag
titleDefault Config
skipFlagsskipstop word
"partialMatchPercent": 50

Example Output

...

Saga_graph
V-------[Welcome to Accenture.]--------V 
^-----[Welcome to Accenture]------V-[]-^ 
^-[Welcome]-V-[to]-V-[Accenture]--^      
^-[welcome]-^      ^-[accenture]--^      
                   ^-[{bestbets}]-^      

...

  • BESTBET - Identifies that the token as a possible reference to a subject to which Saga has a link for.
  • SEMANTIC_TAG - Identifies all lexical items which are semantic tags.

Vertex Flags:

Info

No vertices are created in this stage


Resource Data

The pattern database is a series of JSON records, typically indexed by "pattern block ID".  Each JSON record represents a block of patterns (one or more) that all produce the same semantic tag.  The format is as follows:

...