How does Lucene / Sphinx / Solr work?

I have a website in Phalcon and am trying to add a search engine to it. The content, however, is not in the database and is in flat files .. located in app/views/.

I have never used a search engine, but from what I collect, it seems that Lucene or Solr / Sphinx is what I need.

Do these tools provide an opportunity to analyze my ala HTTrack website, thus creating an index and the necessary absolute URI hyperlinks?

How do I determine which part of the HTML files I want to parse? How do they interact with ignoring certain areas (e.g. HTML, JS)?

+4
source share
1 answer

Lucene - , , . , , . , , . , , . Lucene , . , . , API, . "-" , . . .

Lucene - , , . , " " " ", . Solr - , Lucene, API HTTP . - , . " ", Google, .

+1

All Articles