Have you indexed crawl results using elasticsearch before?

Has anyone managed to write custom indexes for nutch to index crawl results using elasticsearch? Or do you know what already exists?

+8
full-text-search web-crawler elasticsearch lucene nutch
source share
4 answers

Did not do this, but it is definitely doable, but you would need to combine the SOLR code (src / java / org / apache / nutch / indexer / solr) and adapt it to ElasticSearch. Will be a good contribution to Nutch BTW

+2
source share

I wrote an ElasticSearch plugin that mocks the Solr api. Using this plugin and the standard Nutch Solr indexer, you can easily send workarounds to ElasticSearch. The plugin and an example of using it with Nutch can be found on GitHub:

https://github.com/mattweber/elasticsearch-mocksolrplugin

+10
source share

I know that Nutch will add plug-in backends and am glad to see it. I needed to integrate elasticsearch with Nutch 1.3. The code is posted here. Unloaded the code (src / java / org / apache / nutch / indexer / solr).

https://github.com/ctjmorgan/nutch-elasticsearch-indexer

+3
source share

As time goes on, Nucth is now well integrated with ElasticSearch. Here is a good tutorial.

0
source share

All Articles