Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Readers read text streams and create text blocks to process.

...

Breakers read text blocks and break them into individual text blocks.

...

Tokenizers read text blocks and divide them up into individual tokens to be processed.

...

Splitters split up tokens into multiple smaller tokens as an alternative interpretation.

Collapsers

Collapsers reduce tokens into simpler smaller tokens as an alternative interpretation.

  • Character Collapser - 

    Excerpt Include
    Character Collapser Stage
    Character Collapser Stage
    nopaneltrue

Normalizers

Normalizers create alternative normalized interpretations of tokens from original tokens.

...

Recognizers identify and flag tokens based on their character patterns.

  • NumberRecognizerNumber Recognizer - 
    Excerpt Include
    Number Recognizer Stage
    Number Recognizer Stage
    nopaneltrue
  • StopWords Stop Words -
    Excerpt Include
    Stop Words Stage
    Stop Words Stage
    nopaneltrue
  • Lemmatize -
    Excerpt Include
    Lemmatize Stage
    Lemmatize Stage
    nopaneltrue
  • ABA
    Excerpt Include
    ABA Stage
    ABA Stage
    nopaneltrue
  • BIC
    Excerpt Include
    BIC Stage
    BIC Stage
    nopaneltrue

...

Taggers create semantic tags which are added to the interpretation graph as alternative interpretations.

...

Transformers generates tags, not of semantic nature, but with new data for later use

  • Bag Of Words
    Excerpt Include
    Bag Of Words Stage
    Bag Of Words Stage
    nopaneltrue
  • Best Bets
    Excerpt Include
    Best Bets Stage
    Best Bets Stage
    nopaneltrue

...

Producers create consumable output based on the processed graph.

...