What if you clear html with cURL, run php through each img element and read in the binary data of the image file and replace the urr attribute of the src image with the base64 encoding value of the open image file, for example:
'<img src="data:image/jpg;base64,'. base64_encode($imagedata) . '"/>'
if the base64 image data is hard-coded on the page, than this will give you a programmatic way to check that all the images are βloadedβ and prevent the problem of converting PDFs, starting with downloading all the images ...
Webchemist
source share