Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

Identifies tokens

...

that look like numbers and flags

...

the tokens with the "NUMBER" flag.


Operates On:  Lexical Items with TOKEN and possibly other flags as specified below.

Saga_is_recognizer

Info

Currently handles the following situations:

  • Integers:  0, 1, 100
  • Floats:  0.5, 12.2, 3.14159 (.)
  • Negative:  -1003, -12.2
  • Thousands with/without separator: 1000000, 1,000,000 (,)
  • Numbers with scientific notation: 1

...

  • .1x10^-8, 1.1x10

...

  • -8, 1.1x10^8, -1.1x10

...

  • 8, 1.1E±11, -1.1E±11
  • Romans

...

  • numerals: MMC, XII, IV
  • English

...

  • ordinal numbers: 1st, 12th, 23rd
  • Exponents:

...

  •   4^9, 4^-9, 49,

...

  • 4-9
  • Fractions: 4/9, 131/13
Warning
titleWarning

Exponent recognition is not working at the moment.

Note

Does NOT currently recognize:

  • Computer literals:  0xBEA1, 07832
  • European numbers with commas and periods swapped
  • Positive:  +102.3

Operates On:  Lexical Items with TOKEN

Configuration Parameters

    • skipFlags (string array, optional) - Flags to be skipped by this stage
      • Tokens marked with this flags will be ignore by this stage, and no process will be performed.
    • requiredFlags (string array, optional)
      • Tokens need to have all the specified flags, in order to be processed
    • debug (boolean, optional)
      • Enable all debug log functionality of the stage, if any.
    • scientificNotation (boolean, optional)
      • Enable recognition of scientific notation.
    • ordinals (boolean, optional)
      • Enable ordinals numbers recognition.
    • romans (boolean, optional)
      • Enable romans number recognition.
    • ordinalsLang (string, optional)
      • Choose ordinals language. Currently supported english: "en"

...

languagejs
themeEclipse
titleExample Configuration

...

Include Page
Generic Configuration Parameters
Generic Configuration Parameters

Configuration Parameters

  • Parameter
    summaryEnable recognition of scientific notation
    defaultfalse
    namescientificNotation
    typeboolean
  • Parameter
    summaryEnable ordinal number recognition
    defaultfalse
    nameordinals
    typeboolean
  • Parameter
    summaryEnable Roman numeral recognition
    defaultfalse
    nameromans
    typeboolean
  • Parameter
    summaryChoose ordinal language
    defaulten
    nameordinalsLang
    • . Currently supports
      • English: "en"
      • Spanish: "es"
Code Block
languagejs
"

...

scientificNotation": false,

...

"ordinals": true,

...

"romans": false,

...

"ordinalsLang": "en"

...

Flags

Lex-Item Flags:

  • NUMBER - Flagged on all tokens which are numbers according to the rules above.

Vertex Flags:

...

Example Output

Code Block
languagetext

...

...

V----------[1984 42 -10 3.14 ]-----------V  
^--[1984]--V--[42]--V--[-10]--V--[3.14]--^  

...

^----[#]---V---[#]--V---[#]---V----[#]---^

Output Flags

Lex-Item Flags:

  • NUMBER - Flagged on all tokens that are numbers according to the rules above.
  • SEMANTIC_TAG - Identifies all lexical items that are semantic tags.

Vertex Flags:

Info

No vertices are created in this stage