Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

This processor is used to split text blocks by punctuation.  The processor

uses

uses Apaches OpenNLP Sentence Detector

to

 to identify punctuation character marks to define the end of a sentence.

Note

This is a plugin processor. Uses Sentence Breaker Stage.


English modelThe processor includes 4 pre-trained models for specific languages:

  • English model
  • Dutch model
  • German model
  • Portuguese model


Configuration

  • Language - Language ISO code, on the UI is represented by the language name and can be selected from a drop down list.

General Settings

Include Page
Generic Processor Config
Generic Processor Config