Splits tokens on specified characters, typically punctuation. Multiple split characters in a row will create a single split (not multiple splits).
Operates On: Lexical Items with TOKEN
{ "type":"CharacterSplitter", "dontSplitChars":"." }
Splits on all punctuation, except periods.
For example, the token: "SagaToolkit-1.0" will produce the following graph:
V-------[SagaToolkit-1.0]-------V ^---[SagaToolkit]--V--[1.0]----^
{ "type":"CharacterSplitter", "splitChars":"-", "splitFlag":"DASH_SPLIT" }
(splits tokens dashes)
V-----[Abe-Lincoln]-----V--[likes]--V--[the]--V-----[iPhone-*&@#*&7.0]-----V ^--[Abe]--V--[Lincoln]--^ ^--[iPhone]--V--[7]--V--[0]--^
{ "type":"CharacterSplitter", "dontSplitChars": "." }
V-----[Abe-Lincoln]-----V--[likes]--V--[the]--V--[iPhone-*&@#*&7.0]--V ^--[Abe]--V--[Lincoln]--^ ^--[iPhone]--V--[7.0]--^
{ "type":"CharacterSplitter", "splitChars": "-#." "dontSplitChars": "." }
V-----[Abe-Lincoln]-----V--[likes]--V--[the]--V--------[iPhone-*&@#*&7.0]--------V ^--[Abe]--V--[Lincoln]--^ ^--[iPhone]--V--[*&@]--V--[*&7.0]--^