I have a java program running as a service, this program should insert 50 thousand rows / s (1 row of 25 columns) into the cassandra cluster.
My cluster contains 3 nodes, 1 node has 4 processor cores (2.4 GHz i5 core), 4 GB of RAM.
I used Hector api, a multi-threaded, bulk insert, but the performance is too low, as expected (about 25 thousand lines / s).
Someone has another solution for this. Cassandra supports the inner inner insert (without using Thrift).
source share