You may be interested in lxml . This is a separate package and has C components, but the fastest. It also has a very good API that makes it easy to list links in HTML documents, or list forms, sanitize HTML, and more. It also has the ability to parse malformed HTML (it is customizable).
PaweΕ Hajdan Sep 17 '08 at 11:19 2008-09-17 11:19
source share