I am using Datastax Cassandra 3.1.4 with two nodes. I am running a pig with CqlStorage () with 12 million rows in a table, but I believe there is only one card for a simple pig team.
I tried changing split_size regarding pigs, but that didn't work.
Here is my sample request.
x = load'cql://Mykeyspace/MyCF?split_size=1000' using CqlStorage();
y = limit x 500;
dump y
I did not find the input.split.size property in my mapred-site.xml. I guess the default split size is 64 * 1024
I tried set pig.splitCombination false;
Now, taking 513 cards for any entries, I tried the same thing from Hive
I connected to Cassandra from Hive and gave a simple selection of the whole query, where col1> the value in this table contains only 10 entries, but still it works with 513 cards.
Please help me with this.
thanks