I am considering using Tesseract to work with PDF files, so I want to use a library, not an external executable.
I started by downloading the full Tesseract source and looked at its creation. Unfortunately, standard sources do not have development tools on a platform other than Linux, in my case Windows. There are methods for this, and I looked at them.
Firstly, the VS2008 build does not work. I know that he needs Leptonic, but I decided that then I would do it and just try to build the existing code. Error with "fatal error C1083: Cannot open include file:" allheaders.h ": no such file or directory." Nothing to do with Leptotics at this point; it just doesn't work.
Even if I could get past this, I would have to build Leptonica, and that requires using the GNU tools and therefore installing Cygwin, so I gave up. I have a MingW instatllation (I never managed to get Cygwin to work in a convenient way), but I'm not so keen to interfere with such a complex and fragile assembly.
So, I decided that I was just using pre-created binaries that some kind of soul creates. Download it from code.google.com. Now I need to study the code, so the next obvious step is the Tesseract API example, which claims to require "tesseract-ocr-3.02.02-win32-lib-include-dirs.zip", no problem, because I already have this now.
There is no real idea of where the sample API wants the files to be located, but a bit messy gets them in the right places. Click "build" and "fatal error C1083: Cannot open include file:" allheaders.h ": there is no such file or directory", just like trying to create Tesseract from source .....
And generally there is no such file.
So where is this file?
source
share