Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

FAQs

Is needed any other component before the Tesseract OCR component?

Yes, in order to configure properly the component properly, we recommend to use a normalize mime type type.


Is

needed

any preprocessing needed before the Tesseract OCR component?

Right now multipage tiff file is not supported, so you need to split multipage tiff in advance of before the OCR process.

Can I have multiple Tesseract OCR versions installed?

We recommend that only have 1 version is installed, because since sometimes the installations are not completed properly complete , as you will see in the next example.

Troubleshooting

Problem

Sometimes you could encounter a You might get an NPE on the OCR process of the OCR, but and if you enable the debug you could found option, you'll find the real issuecause:

Error opening data file <path to Tesseract>/eng.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to your "tessdata" directory.
Failed loading language 'eng' Tesseract couldn't load any languages! Could not initialize tesseract.

Solution

This could happens happen if you have multiple Tesseract installations of the tesseract, but you can use two approaches to solve this or :

Either you don't have installed the proper language installed you are trying to use:

Panel
borderStylesolid
titleMethod 1 (Recommended)

Clean installation:

  • Uninstall all the tesseract programs that you have on your machine
  • Restart your machine
  • Install again the 5.0.2 version and verify you select have selected the English language or other languages language you want to use
  • Restart again
  • Verify that the TESSDATA_PREFIX is set properly to the tessdata folder in your tesseract installation

Or you might not have the TESSDATA_PREFIX variable correctly defined: 

Panel
borderStylesolid
titleMethod 2

Set properly the TESSDATA_PREFIX environment variable:

  • If your installation was completed properly complete, you should have a folder like this installed: (be sure verify you have the proper languages installed, in this case "eng"):

  • Set the variable to this folder.