Log in
Skip to sidebar
Skip to main content
Confluence
Spaces
Hit enter to search
Help
Online Help
Keyboard Shortcuts
Feed Builder
What’s new
Available Gadgets
About Confluence
Log in
Saga 1.3.4
Pages
Blog
Search
Page tree
Browse pages
Configure
Space tools
View Page
A
t
tachments (0)
Page History
Page Information
View in Hierarchy
View Source
Export to PDF
Export to Word
Pages
…
Saga Language Processing Framework
Saga Library
Pipeline Stages
Built-in Stages
Character Splitter Stage
Page History
Versions Compared
Old Version
16
changes.mady.by.user
Potter (Esteban Alvarado)
Saved on
May 31, 2019
compared with
New Version
17
changes.mady.by.user
Potter (Esteban Alvarado)
Saved on
May 31, 2019
Previous Change: Difference between versions 15 and 16
Next Change: Difference between versions 17 and 18
View Page History
Key
This line was added.
This line was removed.
Formatting was changed.
...
Parameter
summary
List of characters which should be used to split tokens
name
splitChars
If not present, then tokens are split on any sequence of punctuation.
Parameter
summary
List of characters which will NOT be used to split tokens.
name
dontSplitChars
This is typically used to identify exceptions (characters which are not used to split tokens) when splitChars is missing.
These characters are included in the produced tokens.
Parameter
summary
if any character in this list occurs inside a token,
*
that token will be split just before that character
name
splitBeforeChars
Parameter
summary
if any character in this list occurs inside a token,
*
that token will be split just after that character
name
splitAfterChars
Parameter
summary
true/false whether to split on all punctuation (default: true)
name
splitPrefixChars
Parameter
summary
true/false whether to split on all punctuation (default: true)
name
splitSuffixChars
Parameter
summary
The flag to be put on the vertex between the two tokens.
name
splitFlag
If missing, defaults to ALL_PUNCTUATION.
...
Overview
Content Tools
{"serverDuration": 74, "requestCorrelationId": "52b3c086adafb8d4"}