Background
The idea is this:
- A person provides contact information for the purchase of online books.
- The book, as a PDF, is marked with a unique hash
- Book of reviews for people
- PDF passwords are easy to get around or share
An ideal process would be something like this:
- Create a hash based on contact information
- Save contact information and hash in database
- Acquire Book Lock
- Refresh hash file with hash text
- Create a PDF book (using
pdflatex
) - Apply hash to book
- Book Release Lock
- Send a letter with a book.
Technologies
You can use the following technologies (other programming languages ββare possible, but libraries are likely to be limited to those provided by the host):
- C, Java, PHP
- LaTeX Files
- PDF files
- Linux
Question
What programming methods (or open source software) should be explored at:
- Insert a unique hash (or other character) into a PDF
- Create a collusion resistant label
- Non-fragile development (e.g.
PDF -> EPS -> PDF
still contains a label) solution
Study
I reviewed the following options:
- steganography
- Natural Language Processing (NLP)
- Convert blank pages in PDF to images; mark these images; compile pdf
- LaTeX Watermark Pack
- Imagemagick
Questions
The possible solutions I investigated have the following problems:
- Steganography. (a) A master copy of images that are converted to EPS, which are intense and time-consuming, is required; (b) whether the watermark
PDF -> EPS -> PDF
or other types of conversion will be preserved; (c) most of the images are images or screenshots, not PNG photos. - LaTeX. Creates an image cache; any steganographic solution must somehow intercept this process.
- NLP introduces grammatical errors; may change the meaning of technical words.
- Blank pages. Suspect immediately; easily replace suspicious blank pages.
- Watermark package . Draws visible labels.
- ImageMagick. Draws visible labels.
What other solutions are possible?
Related Links
Thanks!
pdflatex watermark steganography
Dave jarvis
source share