I am trying to do a simple job using oozie.
This will be one simple Pig action.
I have a file: FirstScript.pig containing:
dual = LOAD 'default.dual' USING org.apache.hcatalog.pig.HCatLoader(); store dual into 'dummy_file.txt' using PigStorage();
and a workflow.xml file containing:
<workflow-app name="FirstWorkFlow" xmlns="uri:oozie:workflow:0.2"> <start to="FirstJob"/> <action name="FirstJob"> <pig> <job-tracker>hadoop:50300</job-tracker> <name-node>hdfs://hadoop:8020</name-node> <script>/FirstScript.pig</script> </pig> <ok to="okjob"/> <error to="errorjob"/> </action> <ok name='okjob'> <message>job OK, message[${wf:errorMessage()}]</message> </ok> <error name='errorjob'> <message>job error, error message[${wf:errorMessage()}]</message> </error> </workflow-app>
I created a structure:
FirstScript |- lib |---FirstScript.pig |- workflow.xml
What now? How to deploy and run it using oozie?
Can anyone have more experienced help?
Yours faithfully
Pawel
hadoop oozie apache-pig
Pawel kowalski
source share