Trying to convert PDF to text with fonts

I am trying to read a PDF file and save the text in a text file. But at the same time, I also want to save font information at the same time. I know that PDFbox for text strings and fontbox for fonts can be combined?

+4
source share
1 answer

It should have been a comment, but since I got less reputation, I can’t comment, so I write here. I think you can run both pdfbox and fontbox as 2 streams (save time).

Save the data that you get to the Bean, and then use the Bean to get text and font information.

Your problem with getting a combined result can be solved.

+1
source

Source: https://habr.com/ru/post/1413402/


All Articles