On the website I am creating, I use Python-Markdown to format news posts. To avoid issues with dead links and HTTP content issues on an HTTPS page, I require editors to upload all the images to the site and then paste them (I use the tag editor, which I fixed to make it easier to embed these images using standard syntax markdowns).
However, I would like to apply the no-external-images policy in my code.
One way is to write a regular expression to extract the image URLs from the markup source code, or even run it through the markdown renderer and use the DOM parser to extract all src attributes from img tags.
However, I am curious if there is a way to connect to Python-Markdown to extract all links to images or execute native code (e.g. throwing an exception if the link is external) during parsing.
source share