Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

Lemmatize tokens are matched to words in a dictionary.

Info

Uses Lemmatize Stage

Configuration


Method

  • Build-in dictionary - Out of the box dictionaries

    • Language - Currently we support English and Spanish

  • Custom dictionary - CostumeCustom dictionary, to know the format of the dictionary please check Lemmatize Stage

    • Dictionary Resource - The name of the resource where the dictionary can be found 

Tip

You can use the default Saga Server file system provider filesystem-provider pointing to the config folder, and add your dictionary there (e.g filesystem-provider:dictionary-name)


Image Added

Build-in DiccionatyDictionary

Image Removed Image Removed

General Settings

Include PageGeneric Processor ConfigGeneric Processor Config

Lemmatize tokens are matched to words in a dictionary.

Info

Uses Lemmatize Stage

Configuration

Method

  • Build-in dictionary - Out of the box dictionaries

    • Language - Currently we support English and Spanish

  • Custom dictionary - 

  • Dictionary Resource - The name of the resource where the dictionary can be found 

  • Tip

    You can use the default Saga Server file provider

    filesystem-provider
    Build-in Diccionaty
    • dictionary (string, optional) - The resource containing the list of words and relationships.

      • If no dictionary is provided, a default dictionary will be used.
    • include (list, optional) - A list of the relationships to include.exclude (list, optional) - A list of the relationships to exclude.

      languageISO3 (string, optional) - The language the lemmatize stage should use. The value needs to be one of the ISO 3 letter language codes.By default, English is always used unless configured otherwise

      .

      At the moment, only English (ENG) and Spanish (SPA) are available.
    Image Removed


    General Settings

    Include Page
    Generic Processor Config
    Generic Processor Config