You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

This is a guide on how to create a new tag, assign a recognizer to it, add new patterns and text processing example.

This tutorial assumes:

  • The reader ability to create a project with Maven Framework support
  • The data Saga will use is manage through the Saga's user interface, only the creation of new items will be addressed.
  • Java 11+ is installed in the machine

Configure pom.xml

To use the following code you'll need the next dependencies:

Sample pom.xml section
<dependency>
   <groupId>com.fasterxml.jackson.core</groupId>
   <artifactId>jackson-databind</artifactId>
   <version>{jackson-version}</version>
</dependency>

Feel free to use your favorite JSON processing API.


This guide will include simple usage of REST services and the general documentation of this services can be found here.

Getting Basic Information

Before coding a new example there are some things we are assuming.

  1. There is a default pipeline, known as baseline-pipeline, with the following structure:

    baseline-pipeline
    {
        "stages": [
            {
                "language": "en",
                "type": "TextBreakerStage"
            },
            {
                "requiredFlags": [
                    "SENTENCE"
                ],
                "type": "WhitespaceTokenizerStage"
            },
            {
                "type": "StopWordsStage"
            },
            {
                "type": "CaseAnalysis"
            },
            {
                "type": "CharChangeSplitter"
            }
        ]
    }

Pipelines information

You can get information regarding the pipelines using the REST API.  All we need for now is the name of the pipeline we need.  To get a list of pipeline names we can use something like this:


Datasets information


Adding New Items

Add A New Tag

Assign A Recognizer To Tag

Add Patterns To Recognizer

Processing Text

Choosing An Output Format

Related articles

  • No labels