Closed karanveersingh5623 closed 2 years ago
My understanding is that you need to build your own docker image using this Dockerfile: https://github.com/mlcommons/hpc_results_v1.0/blob/master/NVIDIA/benchmarks/cosmoflow/implementations/mxnet/Dockerfile
Hi team
Refering to below git repo
(https://github.com/mlcommons/hpc_results_v1.0/tree/master/NVIDIA/benchmarks/cosmoflow/implementations/mxnet)
I tried adding mount options in SRUN cmd , below is the trace , the nvidia tensorflow image gives the dependencies error . Should I create my own image or do we have some public image which handles this scenario ?
srun --ntasks=1 --container-image=nvcr.io#nvidia/tensorflow:22.05-tf1-py3 --container-name=cosmoflow-preprocess --container-workdir=/mnt/ --container-mounts=/root/hpc_results_v1.0/NVIDIA/benchmarks/cosmoflow/implementations/mxnet:/mnt bash tools/init_datasets.sh /mnt/cosmoUniverse_2019_05_4parE_tf_small /mnt/processed