Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

These stages are contained inside the Saga Core library and are available at all times.

Text Block Readers

Readers read text streams and create text blocks to process.

  • Simple Reader - 
    Excerpt Include
    Simple Reader Stage
    Simple Reader Stage
    nopaneltrue

Text Block Breakers

Breakers read text blocks and break them into individual text blocks.

Tokenizers

Tokenizers read text blocks and divide them up into individual tokens to be processed.

  • Whitespace Tokenizer - 
    Excerpt Include
    Whitespace Tokenizer Stage
    Whitespace Tokenizer Stage
    nopaneltrue

Splitters

Splitters split up tokens into multiple smaller tokens as an alternative interpretation.

  • Character Splitter - 

    Excerpt Include
    Character Splitter Stage
    Character Splitter Stage
    nopaneltrue

  • Char Change Splitter 
    Excerpt Include
    Character Change Splitter Stage
    Character Change Splitter Stage
    nopaneltrue

Collapsers

Collapsers reduce tokens into simpler smaller tokens as an alternative interpretation.

  • Character Collapser - 

    Excerpt Include
    Character Collapser Stage
    Character Collapser Stage
    nopaneltrue

Normalizers

Normalizers create alternative normalized interpretations of tokens from original tokens.

  • Case Analysis - 
    Excerpt Include
    Case Analysis Stage
    Case Analysis Stage
    nopaneltrue
  • Synonym
    Excerpt Include
    Synonym Stage
    Synonym Stage
    nopaneltrue
  • Remove Accents 
    Excerpt Include
    Remove Accents Stage
    Remove Accents Stage
    nopaneltrue

Recognizers

Recognizers identify and flag tokens based on their character patterns.

  • Number Recognizer - 
    Excerpt Include
    Number Recognizer Stage
    Number Recognizer Stage
    nopaneltrue
  • Stop Words -
    Excerpt Include
    Stop Words Stage
    Stop Words Stage
    nopaneltrue
  • Lemmatize -
    Excerpt Include
    Lemmatize Stage
    Lemmatize Stage
    nopaneltrue
  • Synonym Stage
    Excerpt Include
    Synonym Stage
    Synonym Stage
    nopaneltrue
  • ABA  Recognizer - 
    Excerpt Include
    ABA Recognizer Stage
    ABA Recognizer Stage
    nopaneltrue
  • BIC Recognizer - 
    Excerpt Include
    BIC Recognizer Stage
    BIC Recognizer Stage
    nopaneltrue
  • IBAN Recognizer -  
    Excerpt Include
    IBAN Recognizer Stage
    IBAN Recognizer Stage
    nopaneltrue
  • Date Time Recognizer 
    Excerpt Include
    Date Time Recognizer Stage
    Date Time Recognizer Stage
    nopaneltrue
  • Email Recognizer 
    Excerpt Include
    Email Recognizer Stage
    Email Recognizer Stage
    nopaneltrue
  • Phone Number - 
    Excerpt Include
    Phone Number Recognizer Stage
    Phone Number Recognizer Stage
    nopaneltrue
  • Postal Code - 
    Excerpt Include
    Postal Code Recognizer Stage
    Postal Code Recognizer Stage
    nopaneltrue
  • URL Recognizer - 
    Excerpt Include
    URL Recognizer Stage
    URL Recognizer Stage
    nopaneltrue
  • Federal Recognizer 
    Excerpt Include
    Federal Recognizer Stage
    Federal Recognizer Stage
    nopaneltrue
  • IP Address Recognizer
    Excerpt Include
    IP Address Recognizer Stage
    IP Address Recognizer Stage
    nopaneltrue
  • Latitude Longitude Recognizer 
    Excerpt Include
    Latitude Longitude Recognizer Stage
    Latitude Longitude Recognizer Stage
    nopaneltrue
  • MAC Address Recognizer 
    Excerpt Include
    MAC Address Recognizer Stage
    MAC Address Recognizer Stage
    nopaneltrue
  • MAID Recognizer 
    Excerpt Include
    MAID Recognizer Stage
    MAID Recognizer Stage
    nopaneltrue
  • Credit Card Recognizer - 
    Excerpt Include
    Credit Card Recognizer Stage
    Credit Card Recognizer Stage
    nopaneltrue

Taggers

Taggers create semantic tags which are added to the interpretation graph as alternative interpretations.

  • Regex Pattern
    Excerpt Include
    Regex Pattern Stage
    Regex Pattern Stage
    nopaneltrue
  • Simple Regex -  
    Excerpt Include
    Simple Reader Stage
    Simple Reader Stage
    nopaneltrue
  • Dictionary Tagger
    Excerpt Include
    Dictionary Tagger Stage
    Dictionary Tagger Stage
    nopaneltrue
  • Advanced Pattern
    Excerpt Include
    Advanced Pattern Stage
    Advanced Pattern Stage
    nopaneltrue
  • Fragmentation
    Excerpt Include
    Fragmentation Stage
    Fragmentation Stage
    nopaneltrue
  • GeoNames -  
    Excerpt Include
    GeoNames Stage
    GeoNames Stage
    nopaneltrue
  • Token Matcher - 
    Excerpt Include
    Token Matcher Recognizer Stage
    Token Matcher Recognizer Stage
    nopaneltrue

Transformers

Transformers generates tags, not of semantic nature, but with new data for later use

  • Bag Of Words
    Excerpt Include
    Bag Of Words Stage
    Bag Of Words Stage
    nopaneltrue
  • Best Bets
    Excerpt Include
    Best Bets Stage
    Best Bets Stage
    nopaneltrue

  • NGram 
    Excerpt Include
    NGram Stage
    NGram Stage
    nopaneltrue

Python

  • Python Model Recognizer 
    Excerpt Include
    Python Model Recognizer Stage
    Python Model Recognizer Stage
    nopaneltrue
  • Python Model
    Excerpt Include
    Python Model Stage
    Python Model Stage
    nopaneltrue

Producers

Producers create consumable output based on the processed graph.

  • Json Producer
    Excerpt Include
    JSON Producer Stage
    JSON Producer Stage
    nopaneltrue
  • Markup Producer
    Excerpt Include
    Markup Producer Stage
    Markup Producer Stage
    nopaneltrue

Filters

Mark vertices or tokens to be skipped by other stages.