I need to execute MapReduce in my Cassandra cluster, including data locality, i.e. each job only asks for lines that belong to the local Casandra Node where the job runs.
There are tutorials on how to configure Hadoop for MR on an earlier version of Cassandra (0.7). I can not find one for the current version.
What has changed from 0.7 in this regard?
What software modules are needed for minimal configuration (Hadoop + HDFS + ...)?
Do I need Cassandra Enterprise?
cassandra mapreduce hadoop
Maciej miklas
source share