I am trying to find performance benefits for reusing scripts to write and create new objects in the wordcount mapreduce conversion program. However, the two versions take almost the same time to fill in all the big input.
I also tried to give the task less heap space by changing
<property> <name>mapred.child.java.opts</name> <value>-Xmx120m</value> </property>
But both versions worked a little slower compared to the higher heap. I could never get a program that reuses recordings for recording in order to work better. Did I miss something?
The part of wordcount that I modified is
public void map(Object key, Text value, Context context ) throws IOException, InterruptedException { StringTokenizer itr = new StringTokenizer(value.toString()); while (itr.hasMoreTokens()) { context.write(new Text(itr.nextToken()), new IntWritable(1)); } }
source share