Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

In order to train a model we need to use another recognizer as the base and a dataset that contains people's names.  We'll use the Entity recognizer that we set as disabled and the Android Reddit dataset.

a. Enable the Entity recognizer for the {human} tag

b. In the Name recognizer:

    - set the Name recognizer to disabled

    - set the 'Model' field to --NONE–

    - add 'human' value to the field 'Normalized Tags'

    - click on the 'Train' button.

Image Removed

c. Once the Train dialog opens, select the AndroidReddit dataset and set the machine learning options according to your needs. In this case we are leaving everything with default values.

has a good sample of the values we want to identify.  In this case we'll use the CFR-2018 dataset which contains regulations from the government. 

a.  Create a new tag called {emissions-equipment}, attach the Entity recognizer and add the following patterns:

    - vehicle

    - locomotive

    - truck

    -marine engine

    -tanker truck

    -engine

Image Added

b. Attach the Name recognizer to your tag and then click on the train button. Select the 'CFR-2018-title40' dataset and then click on execute button:

Image AddedImage Removed


d. Check out the Background Processes tab to see the progress of the training

e. Once the process completes:

    - go to your tag and disable the Entity recognizer for the {human} tag again

    - enable the Name recognizer    - in the Name recognizer, select your recently created model (result from the training). It should be something like "humanemissions-equipment-[date stamp here]", for example: humanemissions-equipment-20190206172621. 

      You can also use the option --LATEST-- so it will always       use always use the most recent model you have created.

    - use the preview functionality to test your newly created model. Enter the same text as before in the preview text box: "Several employees work from home, Joseph is one of them, Paul too"Set the 'Minimum Probability' field to 0.5

    - Run a TestRun against the CFR dataset.

    - Once the test run finishes, use the search interface to check if any text was tagged with the emissions-equipment tag. If you do see text tagged this means the model you trained is working.


Image Added    - check if names are tagged as well