"Algorithms of the intellectual network" Marmanisa and Babenko. Hands down the best links for what you need (based on your description). Yes, the code is written in Java, but it is so clean and simple that you can probably translate it to the one you need pretty quickly.
In terms of search functionality, it goes beyond indexing and describes, in detail, PageRank, user click improvements and a ranking algorithm based on PageRank for simple documents (ASCII, Word, PDF), which the author called DocRank.
The book also contains perhaps the best, practical coverage of recommendation mechanisms, for example. user-user, item-item, based on content.
HPC_4_ALL
source share