Apache cassandra query / full-text search

I played with the apache cassandra project. Did a fair readin bit, and I have some pretty complicated examples that I did, including inserting single and batch datasets, extracting one and more key-based datasets. Some of the articles I reviewed include

http://www.rackspacecloud.com/blog/2010/05/12/cassandra-by-example http://github.com/digg/lazyboy http://arin.me/blog/wtf-is-a- supercolumn-cassandra-data-model http://www.sodeso.nl/?p=80

I have a pretty good understanding of concepts that are understandable and even have a simple application implemented.

None of the articles describe how the query could be completed, where, for example, the query is a search term entered by the user.

Does anyone know how or can suggest how I will fulfill such a request? Or perhaps a way to create a search index, full-text search, or even close something remotely?

+4
source share
1 answer

You will probably divide the text into words, and will not use these words as keys to your "index". Each word will contain an ordered family of columns with a list of identifiers for your articles, posts, etc. Thus, you can perform simple searches by key (words). When searching for more than one word, use the intersection of these column families.

This is a very simple approach, if you need more complex queries, look at Lucandra - http://github.com/tjake/Lucandra - Lucandra is a full-text search engine with Kassandra as an internal repository.

+3
source

All Articles