You are viewing an old version of this page. View the current version.
Compare with Current
View Page History
Version 1
Next »
This recognizer works in a similar way to the Entity Recognizer in the sense that looks up sequences of tokens in a dictionary to match the text being processed. The difference is that it will also include in the matching text N tokens to the right and/or left of the original matched text.
Configuration
![](/download/attachments/808388919/122%20Token%20Matcher%20Config.png?version=1&modificationDate=1608757868624&api=v2)
Group tokens in its own tag ( type=boolean
| default=false
| optional
)
- When true, the recognizer will create internal tags to tag the left part, the matched text and the right part
Matched tokens tagname ( type=string
| default=_matchedTokens_
| optional
)
- Internal Tag name for the matched text portion
Left tokens tag name ( type=string
| default=_leftTokens_
| optional
)
- Internal Tag name for the tokens at the left of the matched text
Right tokens tag name ( type=string
| default=_rightTokens_
| optional
)
- Internal Tag name for the tokens at the right of the matched text
Adding a Pattern
Click on the
button to open the "Add new Pattern" dialog
-
Field ( type=string
| required
)
- explanation
- Options
-
Field ( type=boolean
| required
)
- explain
-
Confidence Adjustment ( type=double
| default=1
| required
)
- Adjustment factor to apply to the confidence value of 0.0 to 2.0 from (Applies for every pattern match).
- 0.0 to < 1.0 decreases confidence value
- 1.0 confidence value remains the same
- > 1.0 to 2.0 increases confidence value
![](/download/attachments/808388817/pattern-in-row.png?version=1&modificationDate=1559672501000&api=v2)
General Settings
The general settings can be accessed by clicking on
![](/download/thumbnails/808388856/image-2023-8-7_9-7-27.png?version=1&modificationDate=1691420847668&api=v2)
![](/download/attachments/808388856/image-2023-8-7_9-5-29.png?version=1&modificationDate=1691420729124&api=v2)
- Enable - Enable the processor to be use in pipelines.
- Base Pipeline - Indicates the last stage, from a pipeline, needed by the recognizer.
- Skip Flags ( optional ) - Lexical items flags to be ignored by this processor.
- Boundary Flags ( optional ) - List of vertex flags that indicate the beginning and end of a text block.
- Required Flags ( optional ) - Lexical items flags required by every token to be processed.
- At Least One Flag ( optional ) - Lexical items flags needed by every token to be processed.
- Don't Process Flags ( optional ) - List of lexical items flags that are not processed. The difference with "Skip Flags" is that this will drop the path in the Saga graph, skip just skips the token and continues in the same path.
- Confidence Adjustment - Adjustment factor to apply to the confidence value of 0.0 to 2.0 from (Applies for every match).
- 0.0 to < 1.0 decreases confidence value
- 1.0 confidence value remains the same
- > 1.0 to 2.0 increases confidence value
- Debug - Enable debug logging.