What is the best planner for HADOOP. oozie or cron?

Can anyone suggest which one is best for Hadoop. If it's oozi. How oozie differs from cron jobs

+4
source share
4 answers

Oozi is the best option.

Oozie Coordinator lets you trigger actions when files arrive in HDFS. It will be difficult to implement elsewhere.

Oozie gets callbacks from MapReduce jobs, so he knows when they will end and whether they will hang without expensive polling. No other workflow manager can do this.

There are some advantages over crontab or any other, pointing to some links

https://prodlife.wordpress.com/2013/12/09/why-oozie/

+1

Oozie , , - , . Oozie . Oozie . Oozie .

cron hadoop - , - , , . , oozie, , cron.

oozie Java ( ) . Java, oozie .

Cron - , /.

0

Oozie , . , , . workflow.xml coordinator.xml. cron- . , 2 .

[xml]
<coordinator-app name="weekdays-at-two-am"
frequency="0 2 * * 2-6"
start="${start}" end="${end}" timezone="UTC"
xmlns="uri:oozie:coordinator:0.2">
<action>
<workflow>
<app-path>${workflowAppUri}</app-path>
<configuration>
<property>
<name>jobTracker</name>
<value>${jobTracker}</value>
</property>
<property>
<name>nameNode</name>
<value>${nameNode}</value>
</property>
<property>
<name>queueName</name>
<value>${queueName}</value>
</property>
</configuration>
</workflow>
</action>
</coordinator-app>
[/xml]

Coordinator-app, cron- , Oozie, . for . "cron-like", : 1-7 (1 - ), 0-6, cron.

: http://hortonworks.com/blog/new-in-hdp-2-more-powerful-scheduling-options-in-oozie/

0

Apache oozie hdfs.

, , , , oozie. Oozie

I think oozie is the best option

Of course you can use cron. But you have to make a lot of efforts to work with haop.

0
source

All Articles