I would like to know if there is any PDF library in Microsoft.NET capable of extracting text by specifying coordinates.
For example (in pseudo-code):
PdfReader reader = new PdfReader(); reader.Load("file.pdf"); // Top, bottom, left, right in pixels or any other unit string wholeText = reader.GetText(100, 150, 20, 50);
I tried to do this using PDFBox for .NET (the one that runs on top of IKVM), with no luck, and it seems very outdated and undocumented.
Maybe someone has a good selection using PDFBox, iTextSharp, or any other open source library, and he / she can give me a hint.
Thanks in advance.
source share