Excerpt |
---|
...
This Stage flags tokens that are matched to |
...
Stop-Words. The flagged tokens will be skipped in subsequent stages (if so indicated on the configuration). |
Operates On: Lexical Items with TOKEN and possibly other flags as specified below.
Saga_is_recognizer Recognizer false
Include Page | ||
---|---|---|
|
...
|
Parameter summary If true, all stop words and tokens will be
...
processed as case insensitive
...
. default
...
true name caseInsensitive type boolean
Parameter summary
The resource containing the list of stop words. Or the direct list of stop words name stopWords
...
...
...
...
Info | ||
---|---|---|
| ||
a, an, and, are, as, at, be, but, by, for, if, in, into, is, it, no, not, of, on, or, such, that, the, their, then, there, these, they, this, to, was, will, with |
Code Block | ||||
---|---|---|---|---|
|
...
| ||
"caseInsensitive" : true,
"stopWords" : "words-provider:stop_words" |
Code Block | ||||||
---|---|---|---|---|---|---|
| ||||||
"caseInsensitive" : true,
"stopWords" : ["a", "about", "above", "after", "again", "all",
"am", "an", "and", "the", "i", "who", ...] |
Code Block | ||
---|---|---|
|
...
V--------------[A test to be skipped]--------------V
|
...
^--[A]--V--[test]--V--[to]--V--[be]--V--[skipped]--^
|
...
^--[a]--^ Item [A] - [TOKEN, STOP_WORD |
...
] Item [to] - [TOKEN, STOP_WORD |
...
] Item [be] - [TOKEN, STOP_WORD |
...
]
Item [a] - [TOKEN, |
...
STOP_WORD ] |
...
...
...
Info |
---|
No vertices are created in this stage |
The resource data will be a json file with an array of words in a field
...
named stopWords.
Code Block | ||
---|---|---|
|
...
...
"stopWords": ["a", "about", "above", "after", "again", "all", "am", "an", "and", "the", "i", "who", ...] |
...