The Tabular Files Extractor gets a tabular file (a comma/tab separated file) from the content stream in the job and converts each column to a separate subJob.

Example


In the following example suppose that there's a file called "file:test.txt" which contains the following:


File Content
first   second  third
data1   data2   data3


Once executing the Tabular SubJob Extractor, each subJob will contain a row of the original document, which in this case, is only one row:


Result
<doc>
   <parent><fetchUrl>./testdata/com.accenture.aspire.components/testdata/testcommaseparated.csv</fetchUrl></parent>
   <subDocId>test.txt-0</subDocId>
   <extension source="TabularSubJobExtractor">
      <field name="first">data1</field>
      <field name="second">data2</field>
      <field name="third">data3</field>
   </extension>
 </doc>



  • No labels