From the magazine:
The file system schema is βfileβ. This indicates that the specified Hadoop configuration path is incorrect and the system uses the default Hadoop configuration values. The Flink YARN client needs to store its files in a distributed file system.
Flink could not read the Hadoop configuration files. They are either selected from environment variables, for example. HADOOP_HOME, or you can set the configuration directory to flink-conf.yaml before executing the YARN command.
Flink needs to read the Hadoop configuration to find out how to load the flank jar into the cluster file system so that the newly created YARN cluster can access it. If Flink cannot resolve the Hadoop configuration, it uses the local file system to load the jar. This means that the jar will be placed on the machine from which you start your cluster. Therefore, it will not be accessible from the Flink YARN cluster.
See the Flink Configuration Page page for more information.
edit: In Amazong EMR, export HADOOP_CONF_DIR=/etc/hadoop/conf , let Flink find the Hadoop configuration directory.
mxm
source share