Differences between Chaos Gang and -jar Yarn

what's the difference between running a jar file with the hasoop jar and yarn-jar commands?

I have successfully used the โ€œhadoop jarโ€ command on my MAC, but I want to make sure that the execution runs correctly and in parallel on my four cores.

Thanks!!!

+6
source share
1 answer

Short answer

They are probably identical for you, but even if it is not, they should use your cluster to the best of their ability.


Longer answer

/usr/bin/yarn script sets the runtime so that all yarn commands can be run. /usr/bin/hadoop script is not completely concerned about the specifics of the yarn. However, if you have a cluster configured to use yarn as the default mapreduce implementation (MRv2), then hadoop jar will probably act the same as yarn jar to set mapreduce.

In any case, you are probably all right, but you can always check the web-based resource manager (or job tracking) web interface to find out how your task is distributed across the cluster (whether it is a single cluster node or not)

+6
source

All Articles