Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

Identifies tokens that look like dates or time indicators and flags them with the "DATE" flag.


Operates On:  Lexical Items with TOKEN and possibly other flags as specified below, but not on ALL_PUNCTUATION.

Saga_is_recognizer

Info

Currently handles the following situations:

  • Month and Day only: Jan 25, January 25

  • Month, Day and Year: Jan 25 2019, January 25 2019

  • Month and Year only: Jan 2019, January 2019

  • YYYYMMDD Format: 20190125

  • MMDDYYYY Format: 01252019

  • Using separators too: 2019-01-25, 2019/1/25, 01/25/2019, 2019/01/2

  • Dates with Time: 2019-01-25T10:25

  • Dates with Time wihout separators: 20190125T102500

  • Using the 'Z' character at the end: 2019-01-25T10:25:10Z

  • Using AM or PM for the time: 2019-01-25T10:25:10AM, 2019-01-25T10:25:10pm

  • Using 24hr format: 2019-01-25T14:25:10Z

  • The time only: 01:59:59PM, 01:59:59am

...

Include Page
Generic Configuration Parameters
Generic Configuration Parameters

Flags

Lex-Item Flags

  • DATE_TIME - Identifies the token as a date or time indicator.
  • SEMANTIC_TAG - Identifies all lexical items which are semantic tags.

Example

Configuration Parameters

  • Parameter
    summaryDay names separated by space
    defaultmonday tuesday wednesday thursday friday saturday sunday mon tue tu tues thu th thur thurs fri
    namedayNames
    • Defaults:

      • monday
      • tuesday
      • wednesday
      • thursday
      • friday
      • saturday
      • sunday
      • mon
      • tue
      • tu
      • tues
      • thu
      • th
      • thur
      • thurs
      • fri
  • Parameter
    summaryMonth names separated by space
    namemonthNames
    • Defaults:

      • january
      • february
      • march
      • april
      • may
      • june
      • july
      • august
      • september
      • october
      • november
      • december
      • jan
      • feb
      • mar
      • apr
      • jun
      • jul
      • aug
      • sep
      • sept
      • oct
      • nov
      • dec
      • jan.
      • feb.
      • mar.
      • apr.
      • jun.
      • jul.
      • aug.
      • sep.
      • sept.
      • oct.
      • nov.
      • dec.
  • Parameter
    summaryOrdinal number in text separated by space
    nameordinalWords
    • Defaults:

      • first
      • second
      • third
      • fourth
      • fifth
      • sixth
      • seventh
      • eighth
      • ninth
      • tenth
      • eleventh
      • twelfth
      • thirteenth
      • fourteenth
      • fifteenth
      • sixteenth
      • seventeenth
      • eighteenth
      • nineteenth
      • twentieth
      • twenty-first
      • twenty-second
      • twenty-third
      • twenty-fourth
      • twenty-fifth
      • twenty-sixth
      • twenty-seventh
      • twenty-eighth
      • twenty-ninth
      • thirtieth
      • thirty-first
  • Parameter
    summaryMonth names separated by space
    nametimeIndicators
    • Defaults:

      • am
      • pm
      • a.m
      • p.m"
      • a.m.
      • p.m.
Saga_config_stage
boundaryFlagstext block split
stageDateTime
requiredFlagstoken
"dayNames": "monday tuesday wednesday thursday friday saturday sunday mon tue tu tues thu th thur thurs fri",
"monthNames": "january february march april may june july august september october november december jan feb mar apr jun jul aug sep sept oct nov dec jan. feb. mar. apr. jun. jul. aug. sep. sept. oct. nov. dec.",
"ordinalWords": "first second third fourth fifth sixth seventh eighth ninth tenth eleventh twelfth thirteenth fourteenth fifteenth sixteenth seventeenth eighteenth nineteenth twentieth twenty-first twenty-second twenty-third twenty-fourth twenty-fifth twenty-sixth twenty-seventh twenty-eighth twenty-ninth thirtieth thirty-first",
"timeIndicators": "am pm a.m p.m a.m. p.m."

Example Output

FadeToGrey
Saga_graph
Code Block
languagetext
theme
V-----[2016-06-06T15:30:30Z] started.-----V 
^-[2016-06-06T15:30:30Z]-V---[started.]---^ 
^------[{datetime}]------^ 

Item [2016-06-06T15:30:30Z] - [ORIGINAL,HAS_DIGIT,TOKEN,HAS_PUNCTUATION]
Item [started.] - [ORIGINAL,TOKEN,HAS_PUNCTUATION]V----------------------------------------[On January 1, 2017 drilling started, then on February 1st of 2017 equipment broke down.]-----------------------------------------V 
^-[On]-V--[January]--V---[1,]----V-[2017]-V-[drilling]-V---[started,]----V-[then]-V-[on]-V-[February]--V---[1st]----V-[of]-V-[2017]-V-[equipment]-V-[broke]-V---[down.]----^ 
       ^-[{_month_}]-^-[1]-V-[,]-^                     ^-[started]-V-[,]-^               ^-[{_month_}]-^-[1]-V-[st]-^                                       ^-[down]-V-[.]-^ 
       ^----------[{_datetime_}]----------^                                              ^--------------[{_datetime_}]--------------^

Output Flags

Lex-Item Flags

  • DATE_TIME - Identifies the token as a date or time indicator.
  • SEMANTIC_TAG - Identifies all lexical items which are semantic tags.
Info

In version 1.2.2 these flags were added:

  • TIME - Identidies the token as just time indicator
  • HAS_PUNCTUATION - Tokens produced with at least one punctuation character are tagged as HAS_PUNCTUATION. (ALL_PUNCTUATION will not be tagged as HAS_PUNCTUATION).
  • ALL_PUNCTUATION - Tokens processed or produced composed only of punctuation characters are tagged as ALL_PUNCTUATION.


Vertex Flags:

Info

No vertices are created in this stage