Another possible way is to include Pig in Python or JavaScript. You can do something like this (in Python):
import os from org.apache.pig.scripting import Pig P = Pig.compile("PUT YOUR PIG CODE HERE") hdfs_input = "YOUR HDFS INPUT" hdfs_output = "YOUR HDFS OUTPUT" local_output = "YOUR LOCAL OUTPUT" result = P.bind({'in': input, 'out': hdfs_output}).runSingle() os.system("hadoop fs -getmerge " + hdfs_output + " " + local_output)
and run Python code (for example)
pig -useHCatalog python_code.py
source share