You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

Flag tokens matched to stop words, to be skipped for following stages

Operates On:  Lexical Items with TOKEN

Configuration Parameters

  • caseInsensitive (string, optional) - If true, all stop words and tokens will be process as case insensitive (default = true)

  • stopWords (string, optional) - The resource containing the list of stop words

    • See below for the format.


Example Configuration
{
  "type": "StopWordsStage",
  "stopWords" : "words-provider:stop_words"
}

describe the configuration...

Example Output

describe the example output...

V--------------[abraham lincoln likes macaroni and cheese]--------------------V
^--[abraham]--V--[lincoln]--V--[likes]--V--[macaroni]--V--[and]--V--[cheese]--^
              ^---{place}---^           ^----{food}----^         ^---{food}---^
^----------{person}---------^           ^-----------------{food}--------------^

Output Flags

Lex-Item Flags:

  • SKIP - All matched stop words will be marked as SKIP

Resource Data

The resource data will be a database of advanced patterns, and the resulting semantic tag which they produce.

  • No labels