It can also handle complete PDFs, and seems to go at about 15 pages per minute on a modern desktop PC. Once gImageReader is set up and the Frankur toggle/icon is switched, even when taking a screenshot the OCR results were pretty good… Because I didn’t need to set any file-paths to it, in gImageReader. I’m assuming that gImageReader ‘knows’ where Tesseract 4.0 is, and hooks into it automatically. That needs to be launched in Windows Administrator mode, and then it also seems to require a Fraktur download, in order to handle OCR of German blackletter letterforms… On their forums I found a gImageReader beta version that is newly-compiled for Tesseract 4.0 beta. gImageReader is 64-bit Windows and current.
Once installed there are a few Windows GUI front-ends to choose from, with which to operate Tesseract. Tesseract 4.0 supports OCR in a range of old and ancient letterforms including German blackletter (aka Fraktur, in popular parlance ‘Gothic’), but these need to selectively enabled at install… The Tesseract engine was apparently originally from Google, in use there at Google Books, but Google made it open source.
That’s why we have built a Tesseract installer for Windows.” Normally we run Tesseract on Debian GNU Linux, but there was also the need for a Windows version. “The Mannheim University Library uses Tesseract to perform OCR of historical German newspapers.
FREE OCR FONT FREE
The free open-source Tesseract OCR 4.0 for Windows (beta, 64-bit), released 14th April 2018.