It all depends on how they included the data in the PDF. Generally speaking, there are two possible scenarios here:
Data is just a text object in PDF format. You will need to use a tool to extract text from a PDF, and then paste it into your database.
The data is contained in the form fields in PDF format. You will need to use the tool to extract data from the form fields and insert them into your database.
Hopefully scenario # 2 applies to you, because that's exactly what PDF forms are for. Scenario # 1 is just a hack that you would only use if you had no other options. Extracting plain text from a PDF file is not as simple or accurate as you might expect.
If you get a PDF form, all you have to do is combine the correct fields in the PDF form with the corresponding fields in your database and then suck in the data. This process can be fully automated if you wrote your own application.
Does it require writing an application or is there an automated way to do this?
Yes, both of these options will require writing an application or purchasing an application. If you are writing your own application, you need to find a third-party PDF library that supports extracting data from form fields or extracting text from PDF.
Rowan
source share