Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Lemmatize tokens are matched to words in a dictionary.

Info

Uses Lemmatize Stage

Configuration

Method

  • Build-in dictionary - Out of the box dictionaries

    • Language - Currently we support English and Spanish

  • Custom dictionary - Costume dictionary, to know the format of the dictionary please check Lemmatize Stage

    • Dictionary Resource - The name of the resource where the dictionary can be found 

Tip

You can use the default Saga Server file system provider filesystem-provider pointing to the config folder, and add your dictionary there (e.g filesystem-provider:dictionary-name)



Build-in Diccionaty

Image Removed Image Removed

General Settings

Include PageGeneric Processor ConfigGeneric Processor Config

Lemmatize tokens are matched to words in a dictionary.

Info

Uses Lemmatize Stage

Configuration

Method

  • Build-in dictionary - Out of the box dictionaries

    • Language - Currently we support English and Spanish

  • Custom dictionary - 

  • Dictionary Resource - The name of the resource where the dictionary can be found 

  • Tip

    You can use the default Saga Server file provider

    filesystem-provider
    Build-in Diccionaty
    • dictionary (string, optional) - The resource containing the list of words and relationships.

      • If no dictionary is provided, a default dictionary will be used.
    • include (list, optional) - A list of the relationships to include.

    • exclude (list, optional) - A list of the relationships to exclude.

    • languageISO3 (string, optional) - The language the lemmatize stage should use. The value needs to be one of the ISO 3 letter language codes.
      • By default, English is always used unless configured otherwise. At the moment, only English (ENG) and Spanish (SPA) are available.

    Image RemovedImage Added Image Added

    General Settings

    Include Page
    Generic Processor Config
    Generic Processor Config