Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

This stage identifies tokens that look like phone numbers and flag it them as "PHONE".


Operates On:  Lexical Items with TOKEN or SEMANTIC_TAG and possibly other flags as specified below.

Saga_is_recognizer

Note

At this moment only U.S. phone number format is handled by the stage.

Info

These are the supported formats:

  • 7-digits 555-4444
  • Area Codes 333-222-1111 also with parenthesis (333)222-1111
  • Country Code 1-222-333-4444 and +1-222-333-4444


Operates On:  Lexical Items with TOKEN and possibly other flags as specified below.

Saga_is_recognizer
Include Page
Generic Configuration Parameters
Generic Configuration Parameters

...

Saga_config_stage
boundaryFlagstext block split
requiredFlagsoneFlagtoken, semantic tagtitleDefault Config
skipFlagsskip

Example Output

Description

Saga_graph
V------------------------[please call 1-800-555-5555 thank you]------------------------V 
^-[please]-V-[call]-V----------------[1-800-555-5555]----------------V-[thank]-V-[you]-^ 
                    ^-[1]-V-[-]-V-[800]-V-[-]-V-[555]-V-[-]-V-[5555]-^                   
                    ^-------------------[{phone}]--------------------^

...

  • SEMANTIC_TAG - Identifies all lexical items which are semantic tags.
  • PHONE - Identifies that token as a phone number. 

Vertex Flags:

Info

No vertices are created in this stage