I have the Kafka Streaming base code that reads records from one topic, does some processing, and outputs records to another topic.
How does Kafka concurrency streaming work? Does everything work in one thread? I do not see this in the documentation.
If it is single-threaded, I would like multithreaded processing parameters to process large amounts of data.
If it is multithreaded, I need to understand how it works and how to process resources, for example, connections to SQL databases, should be shared in different processing threads.
Is the Kafka built-in streaming API not recommended for scenarios with large volumes compared to other parameters (Spark, Akka, Samza, Storm, etc.)?
apache-kafka apache-kafka-streams
clay
source share