Web page reverse engineer

I want to rebuild any web page into a logical representation of the page. For example, if a web page has a menu, then I want the logical menu structure to possibly be in XML. If there is an article on the webpage, I want an XML node article, if it has a title for the article, I need an XML node header. Basically, I want the logical form of a web page without any user interface.

This logical model can be objects in code or XML, it does not matter, the important part is that it determines what everything on the page means.

+5
source share
2 answers

It looks like you want the person to classify the contents of the page.

, .

, , ul , - ?

?

+3

All Articles