Elasticsearch plugin for document classification

Is there an elasticsearch plugin out there that will allow me to classify the documents that I entered into the index?

The best solution for me would be to classify all the most repeated terms (/ concepts) displayed as tag clouds that the user can move.

Is there any way to achieve this? Any suggestions?

thank

+3
source share
1 answer

The basic idea is to use termsaggregations , which will give one bucket per member.

POST /_search
{
    "aggs" : {
        "genres" : {
            "terms" : { "field" : "genre" }
        }
    }
}

The answer you get will be streamlined, reducing the number of cases:

{
    ...

    "aggregations" : {
        "genres" : {
            "doc_count_error_upper_bound": 0, 
            "sum_other_doc_count": 0, 
            "buckets" : [ 
                {
                    "key" : "jazz",
                    "doc_count" : 10
                },
                {
                    "key" : "rock",
                    "doc_count" : 5
                },
                {
                    "key" : "electronic",
                    "doc_count" : 2
                },
            ]
        }
    }
}

Kibana, tag cloud .

+3

All Articles