When files are transferred to nodes using the distributed cache mechanism in the Hadoop streaming task, does the system delete these files after the task is completed? If they are deleted, what can I assume if there is a way to cache several jobs? Does it work the same on Amazon Elastic Mapreduce?
amazon-web-services elastic-map-reduce hadoop
Jd long
source share