Table of Contents |
---|
The Tabular Files Extractor gets a tabular file (a comma/tab separated file) from the content stream in the job and converts each column to a separate subJob.
In the following example suppose that there's a file called "file:test.txt" which contains the following:
Code Block | ||||
---|---|---|---|---|
| ||||
first second third data1 data2 data3 |
Once executing the Tabular SubJob Extractor, each subJob will contain a row of the original document, which in this case, is only one row:
Code Block | ||||||
---|---|---|---|---|---|---|
| ||||||
<doc> <parent><fetchUrl>./testdata/com.accenture.aspire.components/testdata/testcommaseparated.csv</fetchUrl></parent> <subDocId>test.txt-0</subDocId> <extension source="TabularSubJobExtractor"> <field name="first">data1</field> <field name="second">data2</field> <field name="third">data3</field> </extension> </doc> |