I need to parse (server side) a large number of HTML pages.
We all agree that regexp is not the way to go here. It seems to me that javascript is a native way to parse an HTML page, but this assumption depends on the server-side code that has all the javascript in the DOM in the browser.
Does Node.js have a built-in ability? Is there a better approach to this problem, server side HTML parsing?
Itay Moav -Malimovka Sep 10 '11 at 16:18 2011-09-10 16:18
source share