For illustration purposes, let me assume that this is a forum service. I need to calculate the "similarity" among each user's posts, so the result will be something like this:
among posts by user A, similarity 60% among posts by user B, similarity 20% ...
I deal with multibyte strings, so I guess I'm stuck in search engines. We already use Solr, already implemented more LikeThis, but I'm not quite sure how to build the request. Any help appreciated!
jodeci
source share