...
Readers read text streams and create text blocks to process.
...
Breakers read text blocks and break them into individual text blocks.
...
Tokenizers read text blocks and divide them up into individual tokens to be processed.
...
Splitters split up tokens into multiple smaller tokens as an alternative interpretation.
CharacterSplitterCharacter Splitter -
Excerpt Include |
---|
| Character Splitter Stage |
---|
| Character Splitter Stage |
---|
nopanel | true |
---|
|
- CharChangeSplitter Char Change Splitter -
Excerpt Include |
---|
| Character Change Splitter Stage |
---|
| Character Change Splitter Stage |
---|
nopanel | true |
---|
|
Collapsers
Collapsers reduce tokens into simpler smaller tokens as an alternative interpretation.
Character Collapser -
Excerpt Include |
---|
| Character Collapser Stage |
---|
| Character Collapser Stage |
---|
nopanel | true |
---|
|
Normalizers
Normalizers create alternative normalized interpretations of tokens from original tokens.
...
Recognizers identify and flag tokens based on their character patterns.
- NumberRecognizerNumber Recognizer -
Excerpt Include |
---|
| Number Recognizer Stage |
---|
| Number Recognizer Stage |
---|
nopanel | true |
---|
|
- StopWords Stop Words -
Excerpt Include |
---|
| Stop Words Stage |
---|
| Stop Words Stage |
---|
nopanel | true |
---|
|
- Lemmatize -
Excerpt Include |
---|
| Lemmatize Stage |
---|
| Lemmatize Stage |
---|
nopanel | true |
---|
|
- ABA -
Excerpt Include |
---|
| ABA Stage |
---|
| ABA Stage |
---|
nopanel | true |
---|
|
- BIC -
Excerpt Include |
---|
| BIC Stage |
---|
| BIC Stage |
---|
nopanel | true |
---|
|
...
Taggers create semantic tags which are added to the interpretation graph as alternative interpretations.
...
Transformers generates tags, not of semantic nature, but with new data for later use
- Bag Of Words-
Excerpt Include |
---|
| Bag Of Words Stage |
---|
| Bag Of Words Stage |
---|
nopanel | true |
---|
|
- Best Bets -
Excerpt Include |
---|
| Best Bets Stage |
---|
| Best Bets Stage |
---|
nopanel | true |
---|
|
...
Producers create consumable output based on the processed graph.
...