Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

Identifies tokens which that look like dates or time indicator indicators and flags them with the "DATE" flag.


Operates On:  Lexical Items with TOKEN and possibly other flags as specified below, but not on ALL_PUNCTUATION.

Saga_is_recognizer

Info

Currently handles the following situations:

  • Month and Day only: Jan 25, January 25

  • Year and Month only: 2019 Jan, 2019 January
  • Month , Day and Year only: Jan 25 2019, January 2019, January 25 of 2019, Jan of '19
  • Day Month Year: 26 April 2019, 26th of April, 2019
  • Month and Day Year only: Jan 2019, January . 22, 2001, January the 22nd of 2001
  • Day Name, Day Month Year: Friday, 26 Apr 19, Friday the 26th of April 2019
  • YYYYMMDD Format: 20190125
  • MMDDYYYY Format: 01252019
  • Using separators too: 2019-01-25, 2019/1/25, 01/25/2019, 2019/01/2
  • Dates with Time: 2019-01-25T10:25
  • Dates with Time wihout separators: 20190125T102500
  • Using the 'Z' character at the end: 2019-01-25T10:25:10Z
  • Using AM or PM for the time: 2019-01-25T10:25:10AM, 2019-01-25T10:25:10pm
  • Using 24hr format: 2019-01-25T14:25:10Z
  • The time only: 01:59:59PM, 01:59:59am


Include Page
Generic Configuration Parameters
Generic Configuration Parameters

Flags

Lex-Item Flags:

  • NUMBER - Flagged on all tokens which are numbers according to the rules above.
  • DATE_TIME - Identifies the token as a date or time indicator.
  • SEMANTIC_TAG - Identifies all lexical items which are semantic tags.

Example

Configuration Parameters

  • Parameter
    summaryDay names separated by space
    defaultmonday tuesday wednesday thursday friday saturday sunday mon tue tu tues thu th thur thurs fri
    namedayNames
    • Defaults:

      • monday
      • tuesday
      • wednesday
      • thursday
      • friday
      • saturday
      • sunday
      • mon
      • tue
      • tu
      • tues
      • thu
      • th
      • thur
      • thurs
      • fri
  • Parameter
    summaryMonth names separated by space
    namemonthNames
    • Defaults:

      • january
      • february
      • march
      • april
      • may
      • june
      • july
      • august
      • september
      • october
      • november
      • december
      • jan
      • feb
      • mar
      • apr
      • jun
      • jul
      • aug
      • sep
      • sept
      • oct
      • nov
      • dec
      • jan.
      • feb.
      • mar.
      • apr.
      • jun.
      • jul.
      • aug.
      • sep.
      • sept.
      • oct.
      • nov.
      • dec.
  • Parameter
    summaryOrdinal number in text separated by space
    nameordinalWords
    • Defaults:

      • first
      • second
      • third
      • fourth
      • fifth
      • sixth
      • seventh
      • eighth
      • ninth
      • tenth
      • eleventh
      • twelfth
      • thirteenth
      • fourteenth
      • fifteenth
      • sixteenth
      • seventeenth
      • eighteenth
      • nineteenth
      • twentieth
      • twenty-first
      • twenty-second
      • twenty-third
      • twenty-fourth
      • twenty-fifth
      • twenty-sixth
      • twenty-seventh
      • twenty-eighth
      • twenty-ninth
      • thirtieth
      • thirty-first
  • Parameter
    summaryRecognize dates. (Completely independent from Time, can be used alone or with the Recognize Time flag checked))
    defaultTrue
    namerecognizeDate
    typeboolean
  • Parameter
    summaryRecognize time. (Completely independent from Date, can be used alone or with the Recognize Date flag checked))
    defaultTrue
    namerecognizeTime
    typeboolean
  • Parameter
    summaryMonth names separated by space
    nametimeIndicators
    • Defaults:

      • am
      • pm
      • a.m
      • p.m"
      • a.m.
      • p.m.
Code Block
boundaryFlagstext block split
stageDateTime
requiredFlagstoken
languagejs
"dayNames": "monday tuesday wednesday thursday friday saturday sunday mon tue tu tues thu th thur thurs fri",
"monthNames": "january february march april may june july august september october november december jan feb mar apr jun jul aug sep sept oct nov dec jan. feb. mar. apr. jun. jul. aug. sep. sept. oct. nov. dec.",
"ordinalWords": "first second third fourth fifth sixth seventh eighth ninth tenth eleventh twelfth thirteenth fourteenth fifteenth sixteenth seventeenth eighteenth nineteenth twentieth twenty-first twenty-second twenty-third twenty-fourth twenty-fifth twenty-sixth twenty-seventh twenty-eighth twenty-ninth thirtieth thirty-first",
"recognizeDate": true,
"recognizeTime": true,
"timeIndicators": "am pm a.m p.m a.m. p.m."

Example Output

Code Block
languagetext
Code Block
languagetext
themeFadeToGrey
V-----[2016-06-06T15:30:30Z] started.-----V 
^-[2016-06-06T15:30:30Z]-V---[started.]---^ 
^------[{datetime}]------^ 

Item [2016-06-06T15:30:30Z] - [ORIGINAL,HAS_DIGIT,TOKEN,HAS_PUNCTUATION]
Item [started.] - [ORIGINAL,TOKEN,HAS_PUNCTUATION]V--------------------------------------[On January 1, 2017 drilling started, on February 1st of 2017 equipment broke down.]---------------------------------------V 
^-[On]-V--[January]--V---[1,]----V-[2017]-V-[drilling]-V---[started,]----V-[on]-V-[February]--V---[1st]----V-[of]-V-[2017]-V-[equipment]-V-[broke]-V---[down.]----^ 
       ^-[{_month_}]-^-[1]-V-[,]-^                     ^-[started]-V-[,]-^      ^-[{_month_}]-^-[1]-V-[st]-^                                       ^-[down]-V-[.]-^ 
       ^----------[{_datetime_}]----------^                                     ^--------------[{_datetime_}]--------------^

Output Flags

Lex-Item Flags

  • DATE_TIME - Identifies the token as a date or time indicator.
  • SEMANTIC_TAG - Identifies all lexical items which are semantic tags.
  • TIME - Identifies the token as just time indicator
  • HAS_PUNCTUATION - Tokens produced with at least one punctuation character are tagged as HAS_PUNCTUATION. (ALL_PUNCTUATION will not be tagged as HAS_PUNCTUATION).
  • ALL_PUNCTUATION - Tokens processed or produced composed only of punctuation characters are tagged as ALL_PUNCTUATION.


Vertex Flags:

Info

No vertices are created in this stage