Does Mapreduce apply replication to intermediate data?

At Mapreduce, we say that the output generated by cartographers is called intermediate data.

Are intermediate data repeated?

Temporary intermediate data?

When are intermediate data deleted? Is it deleted automatically or do we need to explicitly delete it?

+4
source share
1 answer

Missed Mapper files are stored in the local file system of the working node where Mapper is running. Similarly, data transferred from one node to another node is stored in the local file system of the working node where the task is performed.

hadoop.tmp.dir '/tmp'.

, , , - , .

+6

All Articles