This These stages are contained inside the Saga Core library and are available in at all timetimes.
Text Block Readers
Readers read text streams and create text blocks to process.
- Simple Reader -
Excerpt Include |
---|
| Simple Reader Stage |
---|
| Simple Reader Stage |
---|
nopanel | true |
---|
|
Text Block Breakers
Breakers read text blocks and break them into individual text blocks.
- Quotation Breaker -
Excerpt Include |
---|
| Quotation Breaker Stage |
---|
| Quotation Breaker Stage |
---|
nopanel | true |
---|
|
- Text Breaker Stage -
Excerpt Include |
---|
| Text Breaker Stage |
---|
| Text Breaker Stage |
---|
nopanel | true |
---|
|
SimpleReader
Tokenizers
Tokenizers read text blocks and divide them up into individual tokens to be processed.
Splitters
Splitters split up tokens into multiple smaller tokens as an alternative interpretation.
Character Splitter -
Excerpt Include |
---|
| Character Splitter Stage |
---|
| Character Splitter Stage |
---|
nopanel | true |
---|
|
- Char Change Splitter -
Excerpt Include |
---|
| Character Change Splitter Stage |
---|
| Character Change Splitter Stage |
---|
nopanel | true |
---|
|
Collapsers
Collapsers reduce tokens into simpler smaller tokens as an alternative interpretation.
Character Collapser -
Excerpt Include |
---|
| Character Collapser Stage |
---|
| Character Collapser Stage |
---|
nopanel | true |
---|
|
CharacterSplitter - Tokens are split when any in a specified set of characters (typically punctuation) is encountered.
- CharChangeSplitter - Tokens are split when any difference between caharaters is encountered.
Normalizers
Normalizers create alternative normalized interpretations of tokens from original tokens.
- Case Analysis -
Excerpt Include |
---|
| Case Analysis Stage |
---|
| Case Analysis Stage |
---|
nopanel | true |
---|
|
- Synonym -
Excerpt Include |
---|
| Synonym Stage |
---|
| Synonym Stage |
---|
nopanel | true |
---|
|
- Remove Accents -
Excerpt Include |
---|
| Remove Accents Stage |
---|
| Remove Accents Stage |
---|
nopanel | true |
---|
|
CaseAnalysis - Analyzes and flags the case of tokens and then (optionally) normalizes the token to lower case.
Recognizers
Recognizers identify and flag tokens based on their character patterns.
...
- Number Recognizer -
Excerpt Include |
---|
| Number Recognizer Stage |
---|
| Number Recognizer Stage |
---|
nopanel | true |
---|
|
- Stop Words -
Excerpt Include |
---|
| Stop Words Stage |
---|
| Stop Words Stage |
---|
nopanel | true |
---|
|
- Lemmatize -
Excerpt Include |
---|
| Lemmatize Stage |
---|
| Lemmatize Stage |
---|
nopanel | true |
---|
|
- Synonym Stage -
Excerpt Include |
---|
| Synonym Stage |
---|
| Synonym Stage |
---|
nopanel | true |
---|
|
- ABA Recognizer -
Excerpt Include |
---|
| ABA Recognizer Stage |
---|
| ABA Recognizer Stage |
---|
nopanel | true |
---|
|
- BIC Recognizer -
Excerpt Include |
---|
| BIC Recognizer Stage |
---|
| BIC Recognizer Stage |
---|
nopanel | true |
---|
|
- IBAN Recognizer -
Excerpt Include |
---|
| IBAN Recognizer Stage |
---|
| IBAN Recognizer Stage |
---|
nopanel | true |
---|
|
- Date Time Recognizer -
Excerpt Include |
---|
| Date Time Recognizer Stage |
---|
| Date Time Recognizer Stage |
---|
nopanel | true |
---|
|
- Email Recognizer -
Excerpt Include |
---|
| Email Recognizer Stage |
---|
| Email Recognizer Stage |
---|
nopanel | true |
---|
|
- Phone Number -
Excerpt Include |
---|
| Phone Number Recognizer Stage |
---|
| Phone Number Recognizer Stage |
---|
nopanel | true |
---|
|
- Postal Code -
Excerpt Include |
---|
| Postal Code Recognizer Stage |
---|
| Postal Code Recognizer Stage |
---|
nopanel | true |
---|
|
- URL Recognizer -
Excerpt Include |
---|
| URL Recognizer Stage |
---|
| URL Recognizer Stage |
---|
nopanel | true |
---|
|
- Federal Recognizer -
Excerpt Include |
---|
| Federal Recognizer Stage |
---|
| Federal Recognizer Stage |
---|
nopanel | true |
---|
|
- IP Address Recognizer -
Excerpt Include |
---|
| IP Address Recognizer Stage |
---|
| IP Address Recognizer Stage |
---|
nopanel | true |
---|
|
- Latitude Longitude Recognizer -
Excerpt Include |
---|
| Latitude Longitude Recognizer Stage |
---|
| Latitude Longitude Recognizer Stage |
---|
nopanel | true |
---|
|
- MAC Address Recognizer -
Excerpt Include |
---|
| MAC Address Recognizer Stage |
---|
| MAC Address Recognizer Stage |
---|
nopanel | true |
---|
|
- MAID Recognizer -
Excerpt Include |
---|
| MAID Recognizer Stage |
---|
| MAID Recognizer Stage |
---|
nopanel | true |
---|
|
- Credit Card Recognizer -
Excerpt Include |
---|
| Credit Card Recognizer Stage |
---|
| Credit Card Recognizer Stage |
---|
nopanel | true |
---|
|
Taggers
Taggers create semantic tags which are added to the interpretation graph as alternative interpretations.
- Regex Pattern -
Excerpt Include |
---|
| Regex Pattern Stage |
---|
| Regex Pattern Stage |
---|
nopanel | true |
---|
|
- Simple Regex -
Excerpt Include |
---|
| Simple Reader Stage |
---|
| Simple Reader Stage |
---|
nopanel | true |
---|
|
- Dictionary Tagger -
Excerpt Include |
---|
| Dictionary Tagger Stage |
---|
| Dictionary Tagger Stage |
---|
nopanel | true |
---|
|
- Advanced Pattern -
Excerpt Include |
---|
| Advanced Pattern Stage |
---|
| Advanced Pattern Stage |
---|
nopanel | true |
---|
|
- Fragmentation -
Excerpt Include |
---|
| Fragmentation Stage |
---|
| Fragmentation Stage |
---|
nopanel | true |
---|
|
- GeoNames -
Excerpt Include |
---|
| GeoNames Stage |
---|
| GeoNames Stage |
---|
nopanel | true |
---|
|
- Token Matcher -
Excerpt Include |
---|
| Token Matcher Recognizer Stage |
---|
| Token Matcher Recognizer Stage |
---|
nopanel | true |
---|
|
Transformers
Transformers generates tags, not of semantic nature, but with new data for later use
- Bag Of Words-
Excerpt Include |
---|
| Bag Of Words Stage |
---|
| Bag Of Words Stage |
---|
nopanel | true |
---|
|
- Best Bets -
Excerpt Include |
---|
| Best Bets Stage |
---|
| Best Bets Stage |
---|
nopanel | true |
---|
|
- NGram -
Excerpt Include |
---|
| NGram Stage |
---|
| NGram Stage |
---|
nopanel | true |
---|
|
Python
- Python Model Recognizer -
Excerpt Include |
---|
| Python Model Recognizer Stage |
---|
| Python Model Recognizer Stage |
---|
nopanel | true |
---|
|
- Python Model -
Excerpt Include |
---|
| Python Model Stage |
---|
| Python Model Stage |
---|
nopanel | true |
---|
|
Producers
Producers create consumable output based on the processed graph.
- Json Producer -
Excerpt Include |
---|
| JSON Producer Stage |
---|
| JSON Producer Stage |
---|
nopanel | true |
---|
|
- Markup Producer -
Excerpt Include |
---|
| Markup Producer Stage |
---|
| Markup Producer Stage |
---|
nopanel | true |
---|
|
Filters
Mark vertices or tokens to be skipped by other stages.
- Sentence Filter Producer
- DictionaryTagger - Looks up all combinations of tokens in a dictionary and tags any that are found.
- AdvancedPattern - AdvancedPattern AdvancedPattern
| Sentence Filter Stage |
---|
nopanel | true |
---|
|