Elastic card reduces external banks

Question

Elastic card reduces external banks

Thus, it is easy enough to handle external jars when using hadoop straight up. You have the -libjars option that will do this for you. The question is how do you do this with EMR. There should be an easy way to do this. I thought the -cachefile CLI option would do this, but I just couldn't get it to work. Any ideas anybody?

Thanks for the help.

+8

jar hadoop amazon-emr

delmet Jun 14 '11 at 0:03

source share

3 answers

Judge mental · Answer 1 · 2012-07-13T03:45:50+0000

The best thing I had with external jar dependencies was to copy them (via the bootstrap action) to /home/hadoop/lib throughout the cluster. This path is on the class path of each node. This method is the only one that works regardless of where the code is located that accesses external banks (tool, task or task).

ajduff574 · Answer 2 · 2011-06-15T04:43:45+0000

One option is to take the first step in your task to configure the JAR, wherever they are. Or, if they are dependencies, you can pack them together with the application JAR (probably on S3).

user1015492 · Answer 3 · 2017-04-17T09:49:30+0000

FYI for newer versions of EMR / home / hadoop / lib is no longer used. / Usr / lib / hadoop-mapreduce.

Elastic card reduces external banks

More articles: