Excerpt |
---|
Identifies tokens that look like dates or time indicators and flags them with the "DATE" flag. |
Operates On: Lexical Items with TOKEN and possibly other flags as specified below, but not on ALL_PUNCTUATION.
Info |
---|
Currently handles the following situations: Month and Day only: Jan 25, January 25 Month, Day and Year: Jan 25 2019, January 25 2019 Month and Year only: Jan 2019, January 2019 YYYYMMDD Format: 20190125 MMDDYYYY Format: 01252019 Using separators too: 2019-01-25, 2019/1/25, 01/25/2019, 2019/01/2 Dates with Time: 2019-01-25T10:25 Dates with Time wihout separators: 20190125T102500 Using the 'Z' character at the end: 2019-01-25T10:25:10Z Using AM or PM for the time: 2019-01-25T10:25:10AM, 2019-01-25T10:25:10pm Using 24hr format: 2019-01-25T14:25:10Z The time only: 01:59:59PM, 01:59:59am
|
...
Include Page |
---|
| Generic Configuration Parameters |
---|
| Generic Configuration Parameters |
---|
|
Flags
Lex-Item Flags
- DATE_TIME - Identifies the token as a date or time indicator.
- SEMANTIC_TAG - Identifies all lexical items which are semantic tags.
Example
Configuration Parameters
Parameter |
---|
summary | Day names separated by space |
---|
default | monday tuesday wednesday thursday friday saturday sunday mon tue tu tues thu th thur thurs fri |
---|
name | dayNames |
---|
|
Defaults:
- monday
- tuesday
- wednesday
- thursday
- friday
| | |
Parameter |
---|
summary | Month names separated by space |
---|
name | monthNames |
---|
|
Defaults:
- january
- february
- march
- april
- may
- june
- july
- august
- september
- october
- november
- december
| - jan
- feb
- mar
- apr
- jun
- jul
- aug
- sep
- sept
- oct
- nov
- dec
| - jan.
- feb.
- mar.
- apr.
- jun.
- jul.
- aug.
- sep.
- sept.
- oct.
- nov.
- dec.
|
Parameter |
---|
summary | Ordinal number in text separated by space |
---|
name | ordinalWords |
---|
|
Defaults:
- first
- second
- third
- fourth
- fifth
- sixth
- seventh
- eighth
- ninth
- tenth
| - eleventh
- twelfth
- thirteenth
- fourteenth
- fifteenth
- sixteenth
- seventeenth
- eighteenth
- nineteenth
- twentieth
| - twenty-first
- twenty-second
- twenty-third
- twenty-fourth
- twenty-fifth
- twenty-sixth
- twenty-seventh
- twenty-eighth
- twenty-ninth
- thirtieth
- thirty-first
|
Parameter |
---|
summary | Month names separated by space |
---|
name | timeIndicators |
---|
|
Saga_config_stage |
---|
boundaryFlags | text block split |
---|
stage | DateTime |
---|
requiredFlags | token |
---|
|
"dayNames": "monday tuesday wednesday thursday friday saturday sunday mon tue tu tues thu th thur thurs fri",
"monthNames": "january february march april may june july august september october november december jan feb mar apr jun jul aug sep sept oct nov dec jan. feb. mar. apr. jun. jul. aug. sep. sept. oct. nov. dec.",
"ordinalWords": "first second third fourth fifth sixth seventh eighth ninth tenth eleventh twelfth thirteenth fourteenth fifteenth sixteenth seventeenth eighteenth nineteenth twentieth twenty-first twenty-second twenty-third twenty-fourth twenty-fifth twenty-sixth twenty-seventh twenty-eighth twenty-ninth thirtieth thirty-first",
"timeIndicators": "am pm a.m p.m a.m. p.m." |
Example Output
Saga_graph |
---|
Code Block |
---|
language | text |
---|
theme | FadeToGrey |
---|
V-----[2016-06-06T15:30:30Z] started.-----V
^-[2016-06-06T15:30:30Z]-V---[started.]---^
^------[{datetime}]------^
Item [2016-06-06T15:30:30Z] - [ORIGINAL,HAS_DIGIT,TOKEN,HAS_PUNCTUATION]
Item [started.] - [ORIGINAL,TOKEN,HAS_PUNCTUATION]V----------------------------------------[On January 1, 2017 drilling started, then on February 1st of 2017 equipment broke down.]-----------------------------------------V
^-[On]-V--[January]--V---[1,]----V-[2017]-V-[drilling]-V---[started,]----V-[then]-V-[on]-V-[February]--V---[1st]----V-[of]-V-[2017]-V-[equipment]-V-[broke]-V---[down.]----^
^-[{_month_}]-^-[1]-V-[,]-^ ^-[started]-V-[,]-^ ^-[{_month_}]-^-[1]-V-[st]-^ ^-[down]-V-[.]-^
^----------[{_datetime_}]----------^ ^--------------[{_datetime_}]--------------^ |
Output Flags
Lex-Item Flags
- DATE_TIME - Identifies the token as a date or time indicator.
- SEMANTIC_TAG - Identifies all lexical items which are semantic tags.
Info |
---|
In version 1.2.2 these flags were added: - TIME - Identidies the token as just time indicator
- HAS_PUNCTUATION - Tokens produced with at least one punctuation character are tagged as HAS_PUNCTUATION. (ALL_PUNCTUATION will not be tagged as HAS_PUNCTUATION).
- ALL_PUNCTUATION - Tokens processed or produced composed only of punctuation characters are tagged as ALL_PUNCTUATION.
|
Vertex Flags:
Info |
---|
No vertices are created in this stage |