...
Include Page |
---|
| Generic Configuration Parameters |
---|
| Generic Configuration Parameters |
---|
|
Configuration Parameters
- patterns (string, required) -
The resource containing the pattern database |
|
.- See below for the format.
- For the format see below.
Saga_config_stage |
---|
" |
Code Block |
---|
language | js |
---|
theme | Eclipse |
---|
title | Example Configuration |
---|
|
{
"type":"SimpleRegex",
"patterns":"regex-provider:patterns"
} |
Example Output
In the following example, "number" is in the dictionary as a regex for using "[0-9]+" and "[0-9]+\\.[0-9]+" :
...
Each JSON record represents an entity. The format is as follows:
Saga_json |
---|
Code Block |
---|
language | js |
---|
theme | Eclipse |
---|
title | Entity JSON Format |
---|
|
{
"_id" : "ca84KGAAJGsBemSwA0nZTLXA",
"tagstag" : [
"number"
],
"patternspattern" : [
"[0-9]+",
"[0-9]+\\.[0-9]+",
"options" : ],{
"confidencecaseInsensitive" : 0.95
true,
"literal" : false
},
"caseInsensitiveconfAdjust": true
}0.95
. . . additional fields as needed go here . . . |
Notes
- Multiple patterns can have the same entry.
- Additional fielded data can be added to the record.
- As needed by downstream processes.
...
Parameter |
---|
summary | Identifies the entity by unique ID. This identifier must be unique across all entries (across all dictionaries). |
---|
name | _id |
---|
required | true |
---|
|
Typically, this is an identifier with meaning to the larger application that is using the Language Processing Toolkit. Parameter |
---|
summary | The list of semantic tags that will be added to the interpretation graph whenever any of the patterns are matched.Tag which will identify any match in the graph, as an interpretation |
---|
name | tagstag |
---|
type | string array |
---|
required | true |
---|
|
- A list of patterns
Pattern to match in the content |
|
.patternstype | string array |
---|
- Options
Parameter |
---|
summary | When this flag is specified then the input string that specifies the pattern is treated as a sequence of literal characters. Metacharacters or escape sequences in the input sequence will be given no special meaning. |
---|
default | false |
---|
name | literal |
---|
|
Parameter |
---|
summary | Set to true if the pattern is not case sensitive. |
---|
default | true |
---|
name | caseInsensitive |
---|
type | boolean |
---|
|
parameter
...
summary | Specifies the confidence level of the entity, independent of any patterns matched. |
---|
name | confidence |
---|
type | double |
---|
- This is the confidence of the entry, in comparison to all of the other entries. Essentially, the likelihood that this entry will be encountered randomly.
| Generic Resource Fields |
---|
| Generic Resource Fields |
---|
|
Other Optional Fields
...
Parameter |
---|
summary | What to show the user when browsing the entity. |
---|
name | display |
---|
|
Parameter |
---|
summary | A context vector that helps disambiguate the entity from others with the same pattern. |
---|
name | context |
---|
|
...