Excerpt |
---|
This stage identifies tokens that looks like URL addresses and flag it them as "URL". |
Operates On: Lexical Items with TOKEN and possibly other flags as specified below.
...
Info |
---|
Currently handles the following situations: - HTTP and HTTPS protocols: http://mydomain.com, https://mydomain.com
- Domains: www.my_domain.com, http://www.blog.my_2nd_domain.com
- IP address (with protocol) e.g. : http://235.156.13.10/
- Ports: http://mydomain.com:9890
- Paths: http://mydomain.com/path/to/dir.html
- Parameters and anchors, query strings: http://mydomain.com/path/to/dir.html?var1=true+or+false&setFlag, http://mydomain.com/path/to/dir.html#section2
- Encoding: http://mydomain.com/path%20to%20dir
|
Include Page |
---|
| Generic Configuration Parameters |
---|
| Generic Configuration Parameters |
---|
|
Configuration Parameters
Parameter |
---|
summary | Enable recognition for domains. (e.g www.mydomain.com) |
---|
default | false |
---|
name | domain |
---|
type | boolean |
---|
|
Parameter |
---|
summary | Enable recognition for urls. (e.g http://mydomain.com) |
---|
default | false |
---|
name | url |
---|
type | boolean |
---|
|
Code Blocksaga_config_stage |
---|
boundaryFlags | text block split |
---|
requiredFlags | token |
---|
language | skipFlags | skipjs |
---|
|
"domain": false,
"url": true |
Example Output
Code Block |
---|
|
saga_graph |
V----------------------------[All the answers in http://www.notaproblem.com.]-----------------------------V
^--------------------------[All the answers in http://www.notaproblem.com]---------------------------V-[]-^
^-[All]-V-[the]-V-[answers]-V-[in]-V------------------[http://www.notaproblem.com]-------------------^
^-[all]-^ ^-[http]-V-[:]-V-[//]-V-[www]-V-[.]-V-[notaproblem]-V-[.]-V-[com]-^
^----------------------------[{URL}]------------------------------^ |
Output Flags
Lex-Item Flags:
- SEMANTIC_TAG - Identifies all lexical items which are semantic tags.
- URL - Identifies the token as an URL address
...