...
These stages are contained inside the Saga Core library and are available
...
at all
...
times.
Text Block Readers
Readers read text streams and create text blocks to process.
- Simple Reader -
Excerpt Include |
---|
| Simple Reader Stage |
---|
| Simple Reader Stage |
---|
nopanel | true |
---|
|
Text Block Breakers
Breakers read text blocks and break them into individual text blocks.
...
- Quotation Breaker -
Excerpt Include |
---|
| Quotation Breaker Stage |
---|
| Quotation Breaker Stage |
---|
nopanel | true |
---|
|
- Text Breaker Stage -
Excerpt Include |
---|
| Text Breaker Stage |
---|
| Text Breaker Stage |
---|
nopanel | true |
---|
|
Tokenizers
Tokenizers read text blocks and divide them up into individual tokens to be processed.
...
- Whitespace Tokenizer -
Excerpt Include |
---|
| Whitespace Tokenizer Stage |
---|
| Whitespace Tokenizer Stage |
---|
nopanel | true |
---|
|
Splitters
Splitters split up tokens into multiple smaller tokens as an alternative interpretation.
...
CharacterSplitter - Tokens are split when any in a specified set of characters (typically punctuation) is encountered.
Character Splitter -
Excerpt Include |
---|
| Character Splitter Stage |
---|
| Character Splitter Stage |
---|
nopanel | true |
---|
|
- Char Change Splitter -
Excerpt Include |
---|
| Character Change Splitter Stage |
---|
| Character Change Splitter Stage |
---|
nopanel | true |
---|
|
Collapsers
Collapsers reduce tokens into simpler smaller tokens as an alternative interpretation.
Character Collapser -
Excerpt Include |
---|
| Character Collapser Stage |
---|
| Character Collapser Stage |
---|
nopanel | true |
---|
|
...
Normalizers
Normalizers create alternative normalized interpretations of tokens from original tokens.
...
- Case Analysis -
Excerpt Include |
---|
| Case Analysis Stage |
---|
| Case Analysis Stage |
---|
nopanel | true |
---|
|
- Synonym -
Excerpt Include |
---|
| Synonym Stage |
---|
| Synonym Stage |
---|
nopanel | true |
---|
|
- Remove Accents -
Excerpt Include |
---|
| Remove Accents Stage |
---|
| Remove Accents Stage |
---|
nopanel | true |
---|
|
Recognizers
Recognizers identify and flag tokens based on their character patterns.
...
- Number Recognizer -
Excerpt Include |
---|
| Number Recognizer Stage |
---|
| Number Recognizer Stage |
---|
nopanel | true |
---|
|
- Stop Words -
Excerpt Include |
---|
| Stop Words Stage |
---|
| Stop Words Stage |
---|
nopanel | true |
---|
|
- Lemmatize -
Excerpt Include |
---|
| Lemmatize Stage |
---|
| Lemmatize Stage |
---|
nopanel | true |
---|
|
- Synonym Stage -
Excerpt Include |
---|
| Synonym Stage |
---|
| Synonym Stage |
---|
nopanel | true |
---|
|
- ABA Recognizer -
Excerpt Include |
---|
| ABA Recognizer Stage |
---|
| ABA Recognizer Stage |
---|
nopanel | true |
---|
|
- BIC Recognizer -
Excerpt Include |
---|
| BIC Recognizer Stage |
---|
| BIC Recognizer Stage |
---|
nopanel | true |
---|
|
- IBAN Recognizer -
Excerpt Include |
---|
| IBAN Recognizer Stage |
---|
| IBAN Recognizer Stage |
---|
nopanel | true |
---|
|
- Date Time Recognizer -
Excerpt Include |
---|
| Date Time Recognizer Stage |
---|
| Date Time Recognizer Stage |
---|
nopanel | true |
---|
|
- Email Recognizer -
Excerpt Include |
---|
| Email Recognizer Stage |
---|
| Email Recognizer Stage |
---|
nopanel | true |
---|
|
- Phone Number -
Excerpt Include |
---|
| Phone Number Recognizer Stage |
---|
| Phone Number Recognizer Stage |
---|
nopanel | true |
---|
|
- Postal Code -
Excerpt Include |
---|
| Postal Code Recognizer Stage |
---|
| Postal Code Recognizer Stage |
---|
nopanel | true |
---|
|
- URL Recognizer -
Excerpt Include |
---|
| URL Recognizer Stage |
---|
| URL Recognizer Stage |
---|
nopanel | true |
---|
|
- Federal Recognizer -
Excerpt Include |
---|
| Federal Recognizer Stage |
---|
| Federal Recognizer Stage |
---|
nopanel | true |
---|
|
- IP Address Recognizer -
Excerpt Include |
---|
| IP Address Recognizer Stage |
---|
| IP Address Recognizer Stage |
---|
nopanel | true |
---|
|
- Latitude Longitude Recognizer -
Excerpt Include |
---|
| Latitude Longitude Recognizer Stage |
---|
| Latitude Longitude Recognizer Stage |
---|
nopanel | true |
---|
|
- MAC Address Recognizer -
Excerpt Include |
---|
| MAC Address Recognizer Stage |
---|
| MAC Address Recognizer Stage |
---|
nopanel | true |
---|
|
- MAID Recognizer -
Excerpt Include |
---|
| MAID Recognizer Stage |
---|
| MAID Recognizer Stage |
---|
nopanel | true |
---|
|
- Credit Card Recognizer -
Excerpt Include |
---|
| Credit Card Recognizer Stage |
---|
| Credit Card Recognizer Stage |
---|
nopanel | true |
---|
|
Taggers
Taggers create semantic tags which are added to the interpretation graph as alternative interpretations.
...
- Regex Pattern -
Excerpt Include |
---|
| Regex Pattern Stage |
---|
| Regex Pattern Stage |
---|
nopanel | true |
---|
|
- Simple Regex -
Excerpt Include |
---|
| Simple Reader Stage |
---|
| Simple Reader Stage |
---|
nopanel | true |
---|
|
- Dictionary Tagger -
Excerpt Include |
---|
| Dictionary Tagger Stage |
---|
| Dictionary Tagger Stage |
---|
nopanel | true |
---|
|
- Advanced Pattern -
Excerpt Include |
---|
| Advanced Pattern Stage |
---|
| Advanced Pattern Stage |
---|
nopanel | true |
---|
|
- Fragmentation -
Excerpt Include |
---|
| Fragmentation Stage |
---|
| Fragmentation Stage |
---|
nopanel | true |
---|
|
- GeoNames -
Excerpt Include |
---|
| GeoNames Stage |
---|
| GeoNames Stage |
---|
nopanel | true |
---|
|
- Token Matcher -
Excerpt Include |
---|
| Token Matcher Recognizer Stage |
---|
| Token Matcher Recognizer Stage |
---|
nopanel | true |
---|
|
Transformers
Transformers generates tags, not of semantic nature, but with new data for later use
- Bag Of Words-
Excerpt Include |
---|
| Bag Of Words Stage |
---|
| Bag Of Words Stage |
---|
nopanel | true |
---|
|
- Best Bets -
Excerpt Include |
---|
| Best Bets Stage |
---|
| Best Bets Stage |
---|
nopanel | true |
---|
|
- NGram -
Excerpt Include |
---|
| NGram Stage |
---|
| NGram Stage |
---|
nopanel | true |
---|
|
Python
- Python Model Recognizer -
Excerpt Include |
---|
| Python Model Recognizer Stage |
---|
| Python Model Recognizer Stage |
---|
nopanel | true |
---|
|
- Python Model -
Excerpt Include |
---|
| Python Model Stage |
---|
| Python Model Stage |
---|
nopanel | true |
---|
|
Producers
Producers create consumable output based on the processed graph.
- Json Producer -
Excerpt Include |
---|
| JSON Producer Stage |
---|
| JSON Producer Stage |
---|
nopanel | true |
---|
|
- Markup Producer -
Excerpt Include |
---|
| Markup Producer Stage |
---|
| Markup Producer Stage |
---|
nopanel | true |
---|
|
Filters
Mark vertices or tokens to be skipped by other stages.
...
...
...
| Sentence Filter Stage |
---|
nopanel | true |
---|
|