Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

Identifies tokens

...

that look like numbers and flags

...

the tokens with the "NUMBER" flag.


Operates On:  Lexical Items with TOKEN and possibly other flags as specified below.

Saga_is_recognizer

Info

Currently handles the following situations:

  • Integers:  0, 1, 100
  • Floats:  0.5, 12.2, 3.14159 (.)
  • Negative:  -1003, -12.2
  • Thousands with/without separator: 1000000, 1,000,000 (,)
  • Numbers with scientific notation: 1

...

  • .1x10^-8, 1.1x10

...

  • -8, 1.1x10^8, -1.1x10

...

  • 8, 1.1E±11, -1.1E±11
  • Romans

...

  • numerals: MMC, XII, IV
  • English

...

  • ordinal numbers: 1st, 12th, 23rd
  • Exponents:

...

  •   4^9, 4^-9, 49,

...

  • 4-9
  • Fractions: 4/9, 131/13
Warning
titleWarning

Exponent recognition is not working at the moment.

Note

Does NOT currently recognize:

  • Computer literals:  0xBEA1, 07832
  • European numbers with commas and periods swapped
  • Positive:  +102.3

Include Page
Generic Configuration Parameters
Generic Configuration Parameters

Configuration Parameters

...

  • Parameter
    summaryEnable recognition of scientific notation

...

  • Enable ordinals numbers recognition.

...

  • Enable romans number recognition.

...

  • Choose ordinals language. Currently supported english: "en"
  • defaultfalse
    namescientificNotation
    typeboolean
  • Parameter
    summaryEnable ordinal number recognition
    defaultfalse
    nameordinals
    typeboolean
  • Parameter
    summaryEnable Roman numeral recognition
    defaultfalse
    nameromans
    typeboolean
  • Parameter
    summaryChoose ordinal language
    defaulten
    nameordinalsLang
    • . Currently supports
      • English: "en"
      • Spanish: "es"
Code Block
languagejs

...

...

"scientificNotation": false,

...

"ordinals": true,

...

"romans": false,

...

"ordinalsLang": "en"

...

Flags

...

  • LEX_ITEM - Identifies then token as lexical item.
  • NUMBER - Flagged on all tokens which are numbers according to the rules above.
  • SEMANTIC_TAG - Identifies all lexical items which are semantic tags.

...

Example Output

Code Block
languagetext

...

...

V----------[1984 42 -10 3.14 ]-----------V  
^--[1984]--V--[42]--V--[-10]--V--[3.14]--^  
^----[#]---V---[#]--V---[#]---V----[#]---^

...

Output Flags

Lex-Item Flags:

  • NUMBER - Flagged on all tokens that are numbers according to the rules above.
  • SEMANTIC_TAG - Identifies all lexical items that are semantic tags.

Vertex Flags:

Info

No vertices are created in this stage