Configuration
Element | Type | Default | Description |
---|
xPath | string |
|
| Specify xPath element in the AspireObject e.g., //category. |
xPath/@regex | string |
|
| The regular expression must be matched for the split field to be moved to the output. |
xPath/@output | string |
|
| The name of the output element that will be created in the document. |
Sample Configuration
<component name="regexSplitter" subType="regex" factoryName="aspire-splitter">
<xPath regex="^[A-Za-z ]*Keywords" output="pg_indexterm_classifications">/doc/pgClassifications_expanded</xPath>
<xPath regex="^[A-Za-z ]*Keywords" output="p_indexterm_classifications">/doc/pClassifications_expanded</xPath>
<xPath regex="^[A-Za-z ]*Keywords" output="pv_indexterm_classifications">/doc/pvClassifications_expanded</xPath>
<xPath regex="^[A-Za-z ]*Keywords" output="ma_indexterm_classifications">/doc/maClassifications_expanded</xPath>
</component>
Example
Input fields:
<maClassifications_expanded source="clasificationExpander">Scottish Keywords;Scottish Keywords/GAELIC LANGUAGE;Scottish Keywords/GAELIC LANGUAGE PROGRAMMES</maClassifications_expanded>
Output fields:
<ma_indexterm_classifications source="RegexSplitter" tagName="maClassifications_expanded">
<val>Scottish Keywords</val>
<val>Scottish Keywords/GAELIC LANGUAGE</val>
<val>Scottish Keywords/GAELIC LANGUAGE PROGRAMMES</val>
</ma_indexterm_classifications>