There is Kapou. Its pure Java and costs money:
http://kapowtech.com/
And there is Lixto: its based on Eclipse and uses Mozilla Gecko as a rendering mechanism (unless they already changed it to WebKit, as they said they would do many years ago). Its very nice and also worth the money:
http://www.lixto.com/?page_id=50
They are graphical tools in which you define site navigation and what you need to extract from points and clicks. But you can also write xpath and regular expressions, and even JavaScript that runs in the context of sites.
I used them both in extracting web data from lectures and when using web data at the Technical University of Vienna (Lixto was written by a professor who gave the lecture).
panzi source share