Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The new patterns can be formed by previously defined tags and text literals. For example, the pattern "{name} likes {product}" where "{name}" and "{product}" are tags and "likes" is the text literal. Since this stage allows for complex entity tagging, is also know as the "Advance Recognizer".

...

  • SEMANTIC_TAG - Identifies all lexical items which are semantic tags.
  • PROCESSED - Placed on all the tokens which composed that compose the semantic tag.

Vertex Flags:

...

Info

No vertices are created in this stage.

Resource Data

The resource data will be a database of advanced patterns, and the resulting semantic tag which that they produce.

Resource Format

The pattern database is a series of JSON records, typically indexed by "pattern block ID". Each JSON record represents an a block of patterns (one or more) which that all produce the same semantic tag. The format is as follows:

...

  • Parameter
    summaryIdentifies the entry. This identifier may not be unique, and is not the id of the database, but an specific use case id
    nameid
    requiredtrue

    • Typically, this is an identifier with meaning to the larger application which that is using the Language Processing Toolkit.
  • Parameter
    summaryWhat to show the user when browsing this entity
    namedisplay
    requiredtrue
  • Parameter
    summaryTag which will identify any match in the graph, as an interpretation
    nametag
    requiredtrue
    • These will all be added to the interpretation graph with the SEMANTIC_TAG flag.
    • Tip

      Tags are hierarchical representations of the same intent. For example, {city} → {administrative-area} → {geographical-area}

  • Parameter
    summaryPattern to match in the content
    namepattern
    requiredtrue

    • Note

      Currently, tokens are separated on simple white-space and punctuation, and then reduced to lowercase.

  • Parameter
    summarySpecifies the confidence level of the entry, independent of any patterns matched
    nameconfidence
    typeboolean
    requiredtrue
    • This is the confidence of the entry, in comparison to all of the other entries. (Essentially, the likelihood that this entity will be randomly encountered.)
  • Parameter
    summaryAdditional information for the entry. At the moment this information is not expected
    defaultempty json
    namefields
    typejson
  • Parameter
    summaryDate in milliseconds of the last time the entry was updated
    nameupdatedAt
    typedate epoch
    requiredtrue
  • Parameter
    summaryDate in milliseconds of the creation time of the entry
    namecreatedAt
    typedate epoch
    requiredtrue