Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.


Easy Heading Free
navigationTitleOn this Page
wrapNavigationTexttrue

Step 1. Launch Aspire and open the Workflows Page.

Launch Aspire (if it's not already running). See:

Step 2. Add or select a Workflow.

  • Add a new workflow or open an existing workflow.
  • For this step, please refer to the Workflow Introduction.

Step 3. Add a new Language Detector to the Workflow.

  • Select the event for which you want to add the Language DetectorSlide Extractor to, from the Event combo.
  • To add a Language DetectorSlide Extractor, drag the Language Detector Slide Extractor from the Rules Section on the right side of the screen and drop it below the Workflow Event to the left side of the screen. This will automatically open the Language Detector Slide Extractor window for the configuration of the publishercomponent.





Step 3a. Specify a description.

  In the Language Detector windowSlide Extractor window, specify the component information.

  1. General:
    1. Description: Enter the description of the component.

    2. Source Field: The source field for detection text.
    3. Detected Language Field: The field where the main detected language will be written.
    4. Detection Type: Type of detection to be used.
      1. All Languages: Include all supported languages.
      2. All Spoken Languages: Include only those languages not yet extinct.
      3. Cyrillic Languages: Include only those languages written with Cyrillic script.
      4. Specific Languages: Decide between the specific languages (Must select at least 2 languages)
      5. All Languages Except: Exclude specific Languages 
  2. Advanced:
      1. Language Detection Output: The field where all detected languages and its statistics will be written.
    1. Debug: Check to enable debug messages.
    2. Limit Source Field text size:  Limit the Source Field's text size to the first n characters.
      1. Size Limit: Maximum size for Source Field's text.
    3. Include ISO Codes: ISO codes will be included in the document. 
      1. Include ISO 639-1 Codes: Check to include the ISO 639-1 codes in the document.
      2. Include ISO 639-3 Codes: Check to include the ISO 639-3 codes in the document.
    1. Set Temp Files Path: The path to the folder where the temporal files were be processed. Also, you can set a System Variable "user.dir" with the path (if False the system variable is needed).

    2. Set Max Character to read: Maximum characters for the file loaded into memory to be parse. (False will set to unlimited).
    3. Add Extra Info from parent: Specify name of the field from the parent job to add as extra info to every slide.

    4. Parse process timeout: Maximum time (in minutes) for the processing thread to wait for the parsing result.

    5. Wait For SubJobs sleep: Time to wait (in milliseconds) until all sub jobs are done.

    6. Debug: Enable debug logs

Image Added


Image Added

Image Removed

Image Removed

Image Removed




Once you've clicked on the Add button, it will take a moment for Aspire to download all the necessary components (the Jar files) from the Maven repository and load them into Aspire. Once that's done, the component will appear in the Workflow Tree.

Info

For details on using the Workflow section, please refer to the Workflow introduction.

...