Work Lucene

Could you suggest the steps that should be followed for lucene. especially with big data (about 1 TB of PDF files for indexing)

+6
java performance lucene
source share
2 answers
  • Read Scaling Lucene and Solr .
  • Determine your needs for Lucene (for example: do you index PDF files - do you need to save the full text, just to make it searchable or not at all?)
  • Do a little experiment - index a few documents, see if the search is good.
  • Try indexing all of this (given tips for quick indexing and indexing for search speed). Is a search enough? Is there enough performance?
  • Iteration
+8
source share

Please see Lucene Performance Optimization Tips. Since you are working with a lot of data, you also need to monitor the performance of creating the index. Some tips on improving indexing performance and search efficiency are available on the Lucene Wiki.

+5
source share

All Articles