Fast inverted index in memory

Question

Fast inverted index in memory

I am looking for a live implementation of a built-in inverted index in memory. All I need is to store functions with weights for several million objects and use an inverted index to calculate the similarity between objects using different distance functions.

All other attributes of objects that I can store in some quick keystore.

I was hoping that I could use Lucene in the same way as an inverted index, but I can’t understand how I can link my own custom vector function with pre-computed weights to the document. Any recommendations would be much appreciated!

Thank.

+5

indexing lucene lucene.net ir

evgenyp Jul 07 '11 at 2:37

source share

4 answers

Grynn · Answer 1 · 2012-05-24T16:13:29+0000

, redis 'zset - , ( , ).

zset -.

, ,
feature → [{docid, score}, {docid, score}..]

zadd: docid

redis, , .. . zunionstore, zrange (http://redis.io/commands/zunionstore).

() .. ( redis db).

gladwig · Answer 2 · 2011-07-30T22:51:41+0000

Terrier? , , , Lucene.

Mike Sokolov · Answer 3 · 2011-09-17T16:01:48+0000

Lucene , . " ", , . , "" , - , Lucene , . .

benroth · Answer 4 · 2012-03-06T16:20:02+0000

, , , , , Lucene - . . .

, , Lucene, , .

org.apache.lucene.search.Similarity

setDefault(Similarity similarity)

(w.r.t. ), () , . , Lucene , ( " AND - OR-?" ), , . tf.idf .

, , LSH:

http://en.wikipedia.org/wiki/Locality-sensitive_hashing

Fast inverted index in memory

More articles: