Excerpt |
---|
This stage maintains a list tokens used to identify possible subjects of interest and suggest a URL reference along with "title" and "description". The title and description fields are used as display data. |
Note |
---|
This stage extends from the Dictionary Tagger Stage. |
Operates On: Lexical Items with TOKEN and possibly other flags as specified below.
Note |
---|
This stage extends from the Dictionary Tagger Stage. |
Include Page |
---|
| Generic Configuration Parameters |
---|
| Generic Configuration Parameters |
---|
|
...
Parameter |
---|
summary | Use to matching only a percentage of the words present in the pattern, only if active on the pattern. |
---|
default | 50 |
---|
name | partialMatchPercent |
---|
type | integer |
---|
|
Saga_config_stagecode |
---|
boundaryFlags | text block split |
---|
requiredFlags | token, semantic tag | all_lower_case |
---|
language | js | skipFlags | stop word |
---|
|
"partialMatchPercent": 50
"dictionary":"saga-provider:saga_bestbets"
"dontProcessFlags":[] |
Example Output
...
|
saga_graph |
V-------[Welcome to Accenture.]--------V
^-----[Welcome to Accenture]------V-[]-^
^-[Welcome]-V-[to]-V-[Accenture]--^
^-[welcome]-^ ^-[accenture]--^
^-[{bestbets}]-^ |
...
- BESTBET - Identifies that the token as a possible reference to a subject to which Saga has a link for.
- SEMANTIC_TAG - Identifies all lexical items which are semantic tags.
- MISSPELL - Identifies tokens with errors or misspells.
Vertex Flags:
Info |
---|
No vertices are created in this stage |
Resource Data
The pattern database is a series of JSON records, typically indexed by "pattern block ID". Each JSON record represents a block of patterns (one or more) that all produce the same semantic tag. The format is as follows:
Saga_jsoncode |
---|
Title | Entity Json Format |
---|
language | js |
---|
|
"usePartialMatch": true,
"patterns": "something1, something2, somnething3",
"description": "Description of the bestbets",
"tag": "search-bet",
"title": "the best bet title",
"url": "http://accenture.enterpricesearch.org",
"confAdjust": 1
. . . additional fields as needed go here . . . |
...