Nutch is very different from what you have ever practiced. Since this is something like a framework, it not only has a front for query and search, but also solth solr seems more powerful than Nutch’s own search interface. It also has a workaround and indexing (in the Lucene index).
If you want to use the crawl for purposes other than search, you will need to develop your own programs and be familiar with Hadoop and MapReduce programming.
Not sure what you want to do with a workaround, but it doesn't look like Nutch is a solution
millebii
source share