You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 3 Current »

This recognizer tags entities based on a NER model trained with the Google AutoML Entity Extraction Cloud API. The recognizer connects to the cloud API to use a model that was trained by the API (Saga doesn't perform the training, yet).

This is a plugin recognizer

Saga uses a keys.json file from the service account configured to interact with this API to authenticate all GCP REST calls. More information here.

Configuration

  • API URL ( type=string | required ) - Url to the API for evaluate. This value can be obtained from the GCP console for the AutoML service.
  • Minimum Probability ( type=double | default=0.7 | required ) - Minimum threshold value to consider a match for tagging.
  • To Upper Case ( type=boolean | default=false | optional ) - In case your model was trained on UPPER_CASE, consider converting the text from Saga before evaluating

General Settings

The general settings can be accessed by clicking on 

More settings could be displayed in the same dialog, it varies per recognizer.


  • Enable - Enable the processor to be use in pipelines.
  • Base Pipeline - Indicates the last stage, from a pipeline, needed by the recognizer.
  • Skip Flags ( optional ) - Lexical items flags to be ignored by this processor.
  • Boundary Flags  ( optional ) - List of vertex flags that indicate the beginning and end of a text block.
  • Required Flags ( optional ) - Lexical items flags required by every token to be processed.
  • At Least One Flag ( optional ) - Lexical items flags needed by every token to be processed.
  • Don't Process Flags ( optional ) - List of lexical items flags that are not processed. The difference with "Skip Flags" is that this will drop the path in the Saga graph, skip just skips the token and continues in the same path.
  • Confidence Adjustment - Adjustment factor to apply to the confidence value of 0.0 to 2.0 from (Applies for every match).
    • 0.0 to < 1.0  decreases confidence value
    • 1.0 confidence value remains the same
    • > 1.0 to  2.0 increases confidence value
  • Debug - Enable debug logging.

  • No labels