Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • patterns (string, required) - The resource which contains the pattern database
    • See below for the format.
  • caseInsensitive (boolean, optional) - If true, all regex will be process as case insensitive (default = true)


Code Block
languagejs
themeEclipse
titleExample Configuration
{
 "type":"DictionaryTagger",
 "patterns":"regex-provider:patterns",
 "caseInsensitive": true
}

...

Each JSON record represents an entity. The format is as follows:

Code Block
languagejs
themeEclipse
titleEntity JSON Format
{
    "_id" : "ca84",
    "tags" : [ 
        "number"
    ],
    "patterns" : [ 
        "[0-9]+", 
        "[0-9]+\\.[0-9]+"
    ],
    "confidence" : 0.95
  . . . additional fields as needed go here . . . 
}

...