I want to use an html parser that makes this a beautiful, elegant way
- Extract text (this is most important)
- Extract links, meta keywords
- Restore original document (optional, but nice feature)
From my research so far, jericho seems to fit. Any other open source libraries that you guys would recommend?
user308808
source
share