Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

Markup Producer stage will producer strings of TEXT_BLOCK or SENTENCE texts with normalized tags based on the graph's path with highest confidence.

...

Include Page
Generic Producer Configuratio Configuration Parameters
Generic Producer Configuratio Configuration Parameters

Configuration Parameters

  • normalizeTags (String array, required) - List of tag names to normalize on the output.
  • replaceTags (String array, optional) - Defaults to empty. If non empty, list of tag names to apply the replace pattern to.
  • replacePattern (String, optional) - Required when replaceTags is set. The pattern expects a %tag and/or %value, pattern will replace the appearance of tags from replaceTags with the pattern defined.
  • separator (String, optional) - Default to single whitespace. Used to separate tokens
  • preferFlags (String array, optional) - Defaults to empty. If non empty, when a token has multiple flags and one of them is in the preferFlags, it will take precedence over other flags to use the value of the token from that variation, i.e. LOWERED flag over original text version.
  • ignoreTags (string array, optional) - Ignore matches with tags specified in the ignoreTags list.
  • anyWithTags (string array, optional) - Include matches with tags specified in the anyWithTags list.


Code Block
languagejs
themeEclipse
titleExample Configuration
{
  "type": "MarkupProducerStage",
  "name": "MarkupProducer",
  "boundaryFlags": [
    "TEXT_BLOCK_SPLIT", "SENTENCE_SPLIT"
  ],
  "normalizeTags": ["#", "measurement"],
  "replaceTags": ["ingredient"],
  "replacePattern": "<START:%tag> %value <END>",
  "separator": " "
}

...