You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 4 Next »


The Tesseract OCR application will perform an OCR operation to a image using the open source tool known as Tesseract

Features


Some of the features of the Tesseract OCR application include:

  • Multiple language support.
  • Multiple page segmentation modes.
  • Multiple image creation color scales and formats.


Limitations 


Since Tesseract is a third party tool that needs to be set up separately from Aspire,  the Tesseract OCR application has the following limitations as per API:

  • Tesseract must be installed separately.
  • Before using a Tesseract feature, it must be properly installed.
    • OCR for other languages as French, Spanish among others.
  • While performing OCR in a file, the order where the languages are provided will affect the output.
  • Multi-page tiff file is not supported right now.

Future Development Plan  


  • Add multi language support
  • Add multipage tiff support

Anything we should add? Please let us know.

  • No labels