Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

Connects directly to the Python Bridge, to send text or sections of the interpretation graph to be process processed by ML algorithms in Python.

This recognizer is used when there is a need to classify an entire document for example. That is the difference when compared to the Python Model Recognizer which will be processing or running for each token or text block.

Processing an entire document has its benefits, it may be the best way to classify a document as a whole.  By running only once per document we get a boost in performance when compared to run the recognizer for each individual token or text block.

The other benefit is that we could normalize the text before sending it to the python model and also specify dependent tags so it runs in the order we need in the processing pipeline.


Info

Uses Python Classification Watcher Stage

Image RemovedImage Added

  • Parameter
    summaryFlag to trigger this recognizer.
    defaultEOF
    nameTrigger Flag
  • Parameter
    summaryDesired protocol to connect to Python Bridge (HTTP / HTTPS)
    defaultHTTP
    nameService Protocol
  • Parameter
    summaryURL for Name of the host where Python Bridge is running.
    defaultlocalhost
    nameService Hostname
    requiredtrue
  • Parameter
    summaryPort where Python Bridge is running.
    default5000
    nameService Port
    requiredtrue
  • Parameter
    summaryWhether or not to use authentication for the python bridge
    defaultfalse
    nameAuthentication
    typeboolean
    requiredtrue
  • Parameter
    summaryModel name registered in the python bridge.
    nameSelect Model
    requiredtrue
  • Parameter
    summaryModel version registered in the python wrapper to query.
    nameSelect Version
  • Parameter
    summaryModel method to call for the model.
    nameSelect Model Method
    requiredtrue

Configuration

Image Added

  • Parameter
    summaryMaximum number of connection retries Whether to send the Python Bridge.original text to Python or not
    defaultfalse
    nameSend Original Text
    typeboolean
  • Parameter
    summarySplits the text in blocks according to the split pattern and sends the text block list
    default3false
    nameNumber of retries
    requiredtrue

Configuration

Image Removed
  • Send As Text Blocks
    typeboolean
  • Parameter
    summarySends the content as a collection of tokens. This setting interacts with the previous ones, the original text can be sent as a token list and both can be grouped in lists of tokens corresponding to text blocks
    defaultfalse
    nameSend as Token Collection
    typeboolean
  • Parameter
    summaryIncludes text from vertex as new tokens.
    defaultfalse
    nameInclude Vertex Text as Token
    typeboolean
  • Parameter
    summaryTags (flagged as SEMANTIC_TAG) are normalized.
    defaultfalse
    nameNormalize Tags
    typeboolean
  • Parameter
    summaryList of tags to be dependent for the model.
    nameDependency tags
    typestring array

General Settings

Include Page
Generic Recognizer Config
Generic Recognizer Config