I have a Kafka cluster that receives messages from a source based on data changes in that source. In some cases, messages are intended to be processed in the future. Therefore, I have 2 options:
Option 1 is easier to execute, but my question is: is Kafka a reliable data warehouse? And has anyone done similar events with Kafka? Are there any holes in the design?
You can set the time that your messages remain in Kafka (log.retention.hours).
, Kafka " " , . , Kafka + Storm . - (MapReduce, Spark...) ?