Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

This processor is used to split text blocks by punctuation.  The processor uses Apaches OpenNLP Sentence Detector to identify punctuation character marks to define the end of a sentence.

Note

This is a plugin processor. Uses Sentence Breaker Stage.


The processor includes 4 pre-trained models for specific languages:

  • English model
  • Dutch model
  • German model
  • Portuguese model


Configuration

Image Added

  • Language - Language ISO code, on the UI is represented by the language name and can be selected from a drop down list.
Image Removed


General Settings

Include Page
Generic Processor Config
Generic Processor Config