Log in
Skip to sidebar
Skip to main content
Confluence
Spaces
Hit enter to search
Help
Online Help
Keyboard Shortcuts
Feed Builder
What’s new
Available Gadgets
About Confluence
Log in
Saga 1.3.4
Pages
Blog
Search
Page tree
Browse pages
Configure
Space tools
View Page
A
t
tachments (0)
Page History
Page Information
View in Hierarchy
View Source
Export to PDF
Export to Word
Pages
…
Saga Language Processing Framework
Saga Library
Pipeline Stages
Built-in Stages
Character Splitter Stage
Page History
Versions Compared
Old Version
15
changes.mady.by.user
Potter (Esteban Alvarado)
Saved on
Jul 20, 2018
compared with
New Version
16
changes.mady.by.user
Potter (Esteban Alvarado)
Saved on
May 31, 2019
Previous Change: Difference between versions 14 and 15
Next Change: Difference between versions 16 and 17
View Page History
Key
This line was added.
This line was removed.
Formatting was changed.
...
Include Page
Generic Configuration Parameters
Generic Configuration Parameters
Configuration Parameters
splitChars
(string, optional) -
Parameter
summary
List of characters which should be used to split tokens
.
name
splitChars
If not present, then tokens are split on any sequence of punctuation.
dontSplitChars
(string, optional) -
Parameter
summary
List of characters which will NOT be used to split tokens.
name
dontSplitChars
This is typically used to identify exceptions (characters which are not used to split tokens) when splitChars is missing.
These characters are included in the produced tokens.
Parameter
summary
if any character in this list occurs inside a token, * that token will be split just before that character
name
splitBeforeChars
Parameter
summary
if any character in this list occurs inside a token, * that token will be split just after that character
name
splitAfterChars
Parameter
summary
true/false whether to split on all punctuation (default: true)
name
splitPrefixChars
Parameter
summary
true/false whether to split on all punctuation (default: true)
name
splitSuffixChars
splitFlag
(string, optional) -
Parameter
summary
The flag to be put on the vertex between the two tokens.
name
splitFlag
If missing, defaults to ALL_PUNCTUATION.
...
Overview
Content Tools
{"serverDuration": 76, "requestCorrelationId": "932d5100f1b76347"}