I think it is very difficult to automate. The only problem is that you need some very good OCR software. And even if you have it, that if he reads something incorrectly, the cause of someone’s frunite handwriting? If the identifier is wrong, the paid is booked to the wrong employee, and if the amount is not correct, he received the wrong salary!
Both are what you really won't. Just to show you how hard it is to find a good ocr, just look at how captcha works. A principle is nothing but an image of a hard-to-read text.
So, my opinion would be that you cannot automate this process. At the very least, you can write a program to help a person by entering values ​​manually (also take a look at Amazon Mechanical Turk ):
- Show the image with handwritten characters on the right, or if they are always in the same place or specially marked (with a frame around them, etc.), try to find these places automatically and show them to the user.
- On the left, offer two text fields in which the user can enter values.
To get a fast and smooth process, you have to take care of how the user can easily enter values ​​simply by using the keyboard:
- When showing a new image, set focus to the id text box
- If the user ID always has a certain length, switch to the next field if all numbers are entered
- (If you enable this, the backspace in the empty next window should be facing the previous one)
- Otherwise, allow the change in the next text box by clicking on the tab or return
- Usually these text fields are located one above the other (not side by side), so you should support switching between them using the up arrow keys.
- After recording is completed, the next image will automatically be displayed in the last text box.
- Also in this case, a new new record (not already entered) makes it easy to return to the old record using the backspace key or the left arrow key)
Using this process, one person can enter many records into your database, and the costs are much cheaper than finding the wrong records in your database.
Just the last sentence:
Because it is a boring process for a person that can easily lead to errors, it is possible that two people enter these values ​​and only if both enter the same way and accept this value as approved. This should lead to the correct speed somewhere above 99%. If you need absolutely 100% to think about allowing 4-5 people to check one record, and only if they all enter the same values, take it as approved. To also get an idea of ​​how good your ocr software will be, just let it run your images as well and compare these results with human input values ​​to get an idea when you can really rely only on ocr.
Oliver
source share