Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This processor is used to split text blocks by punctuation.  The processor uses Apaches OpenNLP Sentence Detector to identify punctuation character marks to define the end of a sentence.

Infonote

This is a plugin processor. Uses Sentence Breaker Stage.

English modelThe processor includes 4 pre-trained models for specific languages:

English model

  • Dutch model
  • German model
  • Portuguese model


Configuration

  • Language - Language ISO code, on the UI is represented by the language name and can be selected from a drop down list.

General Settings

Include Page
Generic Processor Config
Generic Processor Config