Excerpt |
---|
This stage uses Apache Lucene™ to create custom pipelines apart from the default selection of pipelinesa custom Lucene pipeline. It offers a large amount of possible customization options tokenizers and filters to adapt to the users needs. |
Operates On: Lexical Items with TOKEN and possibly other flags as specified below.
Saga_is_recognizer
Info |
---|
A Lucene Custom Analyzer uses is composed of two optionscomponents: the Tokenizer and the Filters (which can be stacked to use more than one at a time). |
Include Page | ||||
---|---|---|---|---|
|
Parameter | ||||||||
---|---|---|---|---|---|---|---|---|
|
Parameter | ||||||
---|---|---|---|---|---|---|
|
Saga_config_stagecode | ||||
---|---|---|---|---|
| ||||
"atLeastOneFlag": [] "boundaryFlags": [] "confidenceAdjustment": 1 "debug": false "requiredFlags": [] "skipFlags": [] "typetokenizer": "LucenePipelineStage"whitespace", "filter": None |
Using Whitespace Tokenizer alone
Code Block | ||
---|---|---|
| ||
V-------------[Hey there! I am using Lucene Pipeline]-------------V ^-[Hey]-V-[there!]-V-[I]-V-[am]-V-[using]-V-[Lucene]-V-[Pipeline]-^ |
Info |
---|
No vertices are created in this stage |