Closed quartox closed 7 years ago
Do you see something like this in the RM logs: Failing this attempt.Diagnostics: java.io.IOException: Mkdirs failed to create /tmp/hadoop-root/nm-local-dir/filecache/12_tmp/test.zip/../../../../../opt/conda/envs/test/etc/conda/deactivate.d
CondaCreator.zip_env
uses relpath when writing - perhaps any files outside of the dirctory tree should be ignored?
Also, in one test I just did, I get too-many-symbolic-links on a terminal-like file when zip does os.stat(filename). The zipping method should apparently have some safeguards.
Yes, the containers have no logs, but I see that diagnostic message:
Diagnostics: java.io.IOException: Mkdirs failed to create /hadoop06/yarn/nm/filecache/105_tmp/dask.zip/../../../../../../../conda-meta
Failing this attempt. Failing the application.
I pass an
env
with the path to my current environment toDaskYarnCluster
and I can see the zip file being built and uploaded but then I get an error (below) saying that the JVM fails to report back. Passing the channel conda-forge starts up the environment correctly, but fails to load a parquet file from hdfs because it lackshdfs3
.My environment should be a superset of the other environment with the same versions of dask and distributed both from the conda-forge channel.