Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Warning
titleHidden

Bag of Words was hidden in version 1.2.2.

It will be back in future versions containing some improvements.



Excerpt

Creates a bag of words / tfidf tag with the vector information for the document/text_block/sentence. Accumulates the vector until the engine cannot read any further

Info

Uses Bag of Words Stage

Configuration

  • Min N-Gram size - Minimum number of tokens per word 
  • Max N-Gram size - Maximum number of tokens per word
  • Vector Type - Algorithm to implement in the vector generation
  • Data Sets - Choose the data set from which the vocabulary was generated

General Settings

Include Page
Generic Processor Config
Generic Processor Config