You can implement this quite easily with any number of open source python projects:
- The mechanism seems pretty popular
- Beautiful soup and urlib
You crawl the site using one of these methods and check the server response, which should be fairly simple.
However, if you have a site map (or any list with all your URLs), you can just try and open each of them with cURL or urllib and get an answer without having to scan.
Julio source share