I am looking for a tool / database / solution that can help me with real-time aggregation of logs and can query them also in real time.
The main requirement is the ability to deliver results as soon as possible, bearing in mind that there may be many events for the query (possibly billions), but there will be many “columns” in the logs, and each query will set some conditions for these columns, so the final result will be some kind of aggregation, or only a small subset of rows will be returned.
Now I was watching HDFS + HBase, which seems like a good solution. Are there any alternatives? Can you recommend something?
source share