I am launching a Druid 0.9.0 on the Azure Cluster with an insight of HDP 2.4.1.1-3. The hadoop client is 2.7.1. After countless attempts to solve the problem with Jackson, in particular:
Error: class com.fasterxml.jackson.datatype.guava.deser.HostAndPortDeserializer overrides final method deserialize.
I tried all the workarounds described here.
https://github.com/druid-io/druid/blob/master/docs/content/operations/other-hadoop.md
to no avail.
Re-compilation failed after shading the Jackson dependency, as suggested by another user facing the same problem.
In the latter case, I tried to add
"mapreduce.job.user.classpath.first": "true"
for the jobProperties property of my indexing task with the following results:
Diagnostics: Exception from container-launch.
Container id: container_e02_1461544451524_0047_05_000001
Exit code: 1
Stack trace: ExitCodeException exitCode=1:
at org.apache.hadoop.util.Shell.runCommand(Shell.java:576)
at org.apache.hadoop.util.Shell.run(Shell.java:487)
at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:753)
at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.launchContainer(LinuxContainerExecutor.java:371)
at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:303)
at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:82)
at java.util.concurrent.FutureTask.run(FutureTask.java:262)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)
Shell output: main : command provided 1
main : run as user is nobody
main : requested yarn user is druid
We are trying to start an indexing job.
Any help would be appreciated. Has anyone had success in the same setup?