Yes, in order to configure properly the component we recommend to use a normalize mime type
Right now multipage tiff file is not supported, so you need to split multipage tiff in advance of OCR process.
We recommend that only have 1 version installed, because sometimes the installations are not properly complete as you will see in the next example.
Sometimes you could encounter a NPE on the process of the OCR, but if you enable the debug you could found the real issue:
Error opening data file <path to Tesseract>\eng.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to your "tessdata" directory.
Failed loading language 'eng' Tesseract couldn't load any languages! Could not initialize tesseract.
This could happens if you have multiple installations of the tesseract, but you can use two approaches to solve this:
Clean installation:
Set properly the TESSDATA_PREFIX environment variable: