This These stages are contained inside the Saga Core library and are available in at all timetimes.
Text Block Readers
Readers read text streams and create text blocks to process.
- SimpleReaderSimple Reader -
Excerpt Include |
---|
| SimpleReader Simple Reader StageSimpleReader |
---|
| Simple Reader Stage |
---|
nopanel | true |
---|
|
Text Block Breakers
Breakers read text blocks and breaks break them into individual text blocks.
- QuotationBreaker Quotation Breaker -
Excerpt Include |
---|
| Quotation Breaker Stage |
---|
| Quotation Breaker Stage |
---|
nopanel | true |
---|
|
- Text Breaker Stage -
Excerpt Include |
---|
| Text Breaker Stage |
---|
| Text Breaker Stage |
---|
nopanel | true |
---|
|
Tokenizers
Tokenizers read text blocks and divide them up into individual tokens to be processed.
Splitters
Splitters split up tokens into multiple smaller tokens as an alternative interpretation.
CharacterSplitterCharacter Splitter -
Excerpt Include |
---|
CharacterSplitter Stage | CharacterSplitter |
---|
| Character Splitter Stage |
---|
| Character Splitter Stage |
---|
nopanel | true |
---|
|
- Char Change Splitter -
Excerpt Include |
---|
| Character Change Splitter Stage |
---|
| Character Change Splitter Stage |
---|
nopanel | true |
---|
|
Collapsers
Collapsers reduce tokens into simpler smaller tokens as an alternative interpretation.
Character Collapser
CharChangeSplitter -
CharChangeSplitter | Character Collapser Stage |
---|
|
CharChangeSplitter | Character Collapser Stage |
---|
nopanel | true |
---|
|
Normalizers
Normalizers create alternative normalized interpretations of tokens from original tokens.
- CaseAnalysisCase Analysis -
Excerpt Include |
---|
| Case Analysis Stage |
---|
| Case Analysis Stage |
---|
nopanel | true |
---|
|
- Synonym -
Excerpt Include |
---|
| Synonym Stage |
---|
| Synonym Stage |
---|
nopanel | true |
---|
|
- Remove Accents -
Excerpt Include |
---|
| Remove Accents Stage |
---|
| Remove Accents | CaseAnalysis Stage | CaseAnalysis Stage |
---|
nopanel | true |
---|
|
Recognizers
Recognizers identify and flag tokens based on their character patterns.
...
...
...
| Number Recognizer Stage |
---|
nopanel | true |
---|
|
...
...
...
| Stop Words Stage |
---|
nopanel | true |
---|
|
- Lemmatize -
Excerpt Include |
---|
| Lemmatize Stage |
---|
| Lemmatize Stage |
---|
nopanel | true |
---|
|
- Synonym Stage -
Excerpt Include |
---|
| Synonym Stage |
---|
| Synonym Stage |
---|
nopanel | true |
---|
|
- ABA Recognizer -
Excerpt Include |
---|
| ABA Recognizer Stage |
---|
| ABA Recognizer Stage |
---|
nopanel | true |
---|
|
- BIC Recognizer -
Excerpt Include |
---|
| BIC Recognizer Stage |
---|
| BIC Recognizer Stage |
---|
nopanel | true |
---|
|
- IBAN Recognizer -
Excerpt Include |
---|
| IBAN Recognizer Stage |
---|
| IBAN Recognizer Stage |
---|
nopanel | true |
---|
|
- Date Time Recognizer -
Excerpt Include |
---|
| Date Time Recognizer Stage |
---|
| Date Time Recognizer Stage |
---|
nopanel | true |
---|
|
- Email Recognizer -
Excerpt Include |
---|
| Email Recognizer Stage |
---|
| Email Recognizer Stage |
---|
nopanel | true |
---|
|
- Phone Number -
Excerpt Include |
---|
| Phone Number Recognizer Stage |
---|
| Phone Number Recognizer Stage |
---|
nopanel | true |
---|
|
- Postal Code -
Excerpt Include |
---|
| Postal Code Recognizer Stage |
---|
| Postal Code Recognizer Stage |
---|
nopanel | true |
---|
|
- URL Recognizer -
Excerpt Include |
---|
| URL Recognizer Stage |
---|
| URL Recognizer Stage |
---|
nopanel | true |
---|
|
- Federal Recognizer -
Excerpt Include |
---|
| Federal Recognizer Stage |
---|
| Federal Recognizer Stage |
---|
nopanel | true |
---|
|
- IP Address Recognizer -
Excerpt Include |
---|
| IP Address Recognizer Stage |
---|
| IP Address Recognizer Stage |
---|
nopanel | true |
---|
|
- Latitude Longitude Recognizer -
Excerpt Include |
---|
| Latitude Longitude Recognizer Stage |
---|
| Latitude Longitude Recognizer Stage |
---|
nopanel | true |
---|
|
- MAC Address Recognizer -
Excerpt Include |
---|
| MAC Address Recognizer Stage |
---|
| MAC Address Recognizer Stage |
---|
nopanel | true |
---|
|
- MAID Recognizer -
Excerpt Include |
---|
| MAID Recognizer Stage |
---|
| MAID Recognizer Stage |
---|
nopanel | true |
---|
|
- Credit Card Recognizer -
Excerpt Include |
---|
| Credit Card Recognizer Stage |
---|
| Credit Card Recognizer Stage |
---|
nopanel | true |
---|
|
Taggers
Taggers create semantic tags which are added to the interpretation graph as alternative interpretations.
- RegexPattern Regex Pattern -
Excerpt Include |
---|
| Regex Pattern Stage |
---|
| Regex Pattern Stage |
---|
nopanel | true |
---|
|
- Simple Regex -
Excerpt Include |
---|
| Simple Reader Stage |
---|
| Simple Reader Stage |
---|
nopanel | true |
---|
|
- Dictionary Tagger -
Excerpt Include |
---|
| Dictionary Tagger Stage |
---|
| Dictionary Tagger Stage |
---|
nopanel | true |
---|
|
- Advanced Pattern -
Excerpt Include |
---|
| Advanced Pattern Stage |
---|
| Advanced Pattern Stage |
---|
nopanel | true |
---|
|
- Fragmentation -
Excerpt Include |
---|
| Fragmentation Stage |
---|
| Fragmentation Stage |
---|
nopanel | true |
---|
|
- GeoNames -
Excerpt Include |
---|
| GeoNames Stage |
---|
| GeoNames Stage |
---|
nopanel | true |
---|
|
- Token Matcher -
Excerpt Include |
---|
| Token Matcher Recognizer Stage |
---|
| Token Matcher Recognizer Stage |
---|
nopanel | true |
---|
|
Transformers
Transformers generates tags, not of semantic nature, but with new data for later use
- Bag Of Words-
Excerpt Include |
---|
| Bag Of Words Stage |
---|
| Bag Of Words Stage |
---|
nopanel | true |
---|
|
- Best Bets -
Excerpt Include |
---|
| Best Bets Stage |
---|
| Best Bets Stage |
---|
nopanel | true |
---|
|
- NGram -
Excerpt Include |
---|
| NGram Stage |
---|
| NGram Stage |
---|
nopanel | true |
---|
|
Python
- Python Classification Watcher Stage -
Excerpt Include |
---|
| Python Model Recognizer Stage |
---|
| Python Model Recognizer Stage |
---|
nopanel | true |
---|
|
- Python Model Recognizer -
Excerpt Include |
---|
| Python Model Recognizer Stage |
---|
| Python Model Recognizer Stage |
---|
nopanel | true |
---|
|
- Python Model -
Excerpt Include |
---|
| Python Model Stage |
---|
| Python Model Stage |
---|
nopanel | true |
---|
|
Producers
Producers create consumable output based on the processed graph.
- Json Producer -
Excerpt Include |
---|
| JSON Producer Stage |
---|
| JSON Producer Stage |
---|
nopanel | true |
---|
|
- Markup Producer -
Excerpt Include |
---|
| Markup Producer Stage |
---|
| Markup Producer Stage |
---|
nopanel | true |
---|
|
Filters
Mark vertices or tokens to be skipped by other stages.
- Sentence Filter ProducerDictionaryTagger - DictionaryTagger DictionaryTagger
| Sentence Filter Stage |
---|
nopanel | true |
---|
|
- AdvancedPattern Length Filter Stage - AdvancedPattern AdvancedPattern
| Length Filter Stage |
---|
nopanel | true |
---|
|