I am trying to create a request for an answer icon (multiple choice question) python. The answer sheet will be scanned into an image file (gif, png, jpg, depending on the format).
My application has access to a database where all answers are stored.
Thus, all you need is some data from the scanned image so that it can compare the answer and calculate the labels.
The answer sheet is fixed in size with a tabular format like this (answers will be marked with an βXβ to indicate their answers):

After searching the Internet, I found that there are several OCR APIs.
The first is Pytesser . It is very easy to use, and the results are all right. But it only works for images with pure clear texts. So, I think this is not suitable.
The second found Ocropus . It seems powerful, but it has documentation
Window
OCRopus relies heavily on POSIX path names and file systems. You may be able to install OCRopus on Windows. An easier way is to install VirtualBox and run OCRopus on Ubuntu under VirtualBox.
So, I think this is mainly for Linux. I could not find a detailed installation guide for the window platform. (I'm new, so I might be wrong)
The third I found python-tesseract , a wrapper for Tesseract OCR . An installation guide was provided on the page. Basically, I need
- python-tesseract-win32.deb
- python opencv
- Numpy
but I do not know how to install .deb files in a window. I already have opencv and nampy .
So the following questions:
(1) How can I convert a table image into processed data (is this possible?)?
(2) Are there any other useful OCR APIs that I haven't mentioned here that might be useful?
(3) Finally, (my stupid idea) Is it possible to split the image into small cartridges (depending on the size of the table cells - since the table sizes are known) using PIL , and then use Pytesser to convert each small image to text, then process the data accordingly ?
FYI: I only need this for the Windows Platform, perhaps for Windows xp 32 bit. I am using python 2.7.5.