Part Of Speech tags a word in a text (corpus) as corresponding to a particular part of speech such as noun, verb, adjective, etc., based on its definition, as well as its context. Using OpenNLP (https://opennlp.apache.org/) and its POS TaggerThe tagging of each token is done with flags, meaning that no semantic tag is created with this stage.
Operates On: Lexical Items with TOKEN and possibly other flags as specified below.
Library: saga-parts-of-speech-stage
Currently only English is supported
$action.getHelper().renderConfluenceMacro("$codeS$body$codeE")
$action.getHelper().renderConfluenceMacro("$codeS$body$codeE")
POS_??? - Flags all TOKENs where a part of speech was recognized.
Notice '???' at the end of the Flag. This is replaced by an acronym of the part-to-speech identified.
For example, if a base form verb is detected, the acronym is VB, and the Flag will be "POS_VB"
No vertices are created in this stage
Flag | Definition |
---|---|
POS_CC | Coordinating conjunction |
POS_CD | Cardinal number |
POS_DT | Determiner |
POS_EX | Existential there |
POS_FW | Foreign word |
POS_IN | Preposition or subordinating conjunction |
POS_JJ | Adjective |
POS_JJR | Adjective, comparative |
POS_JJS | Adjective, superlative |
POS_LS | List item marker |
POS_MD | Modal |
POS_NN | Noun, singular or mass |
POS_NNS | Noun, plural |
POS_NNP | Proper noun, singular |
POS_NNPS | 'Proper noun, plural |
POS_PDT | Predeterminer |
POS_POS | Possessive ending |
POS_PRP | Personal pronoun |
POS_PRP$ | Possessive pronoun |
POS_RB | Adverb |
POS_RBR | Adverb, comparative |
POS_RBS | Adverb, superlative |
POS_RP | Particle |
POS_SYM | Symbol |
POS_TO | to |
POS_UH | Interjection |
POS_VB | Verb, base form |
POS_VBD | Verb, past tense |
POS_VBG | Verb, gerund or present participle |
POS_VBN | Verb, past participle |
POS_VBP | Verb, non-3rd person singular present |
POS_VBZ | Verb, 3rd person singular present |
POS_WDT | Wh-determiner |
POS_WP | Wh-pronoun |
POS_WP$ | Possessive wh-pronoun |
POS_WRB | Wh-adverb |