Closed ChenYi015 closed 1 month ago
Failed to submit tensorflow training job due to unsupported success policy:
$ arena submit tf \ --name=tf-git \ --working-dir=/root \ --gpus=1 \ --image=kube-ai-registry.cn-shanghai.cr.aliyuncs.com/kube-ai/tensorflow:1.5.0-devel-gpu \ --sync-mode=git \ --sync-source=https://code.aliyun.com/xiaozhou/tensorflow-sample-code.git \ --data=nas-pvc:/mnt \ --tensorboard \ --logdir=/mnt/tf_data/logs \ "python code/tensorflow-sample-code/tfjob/docker/mnist/main.py --log_dir /mnt/tf_data/logs --data_dir /mnt/tf_data/" ERRO[0000] failed to validate command args: unsupported successPolicy
Job should be submitted successfully when success policy is not specified.
Arena version:
$ arena version arena: v0.10.0 BuildDate: 2024-10-14T02:30:12Z GitCommit: 62ed8b98a8c2edd6196e3b885d07b525e24e0c2d GitTreeState: clean GitTag: v0.10.0 GoVersion: go1.23.1 Compiler: gc Platform: darwin/arm64
Give it a 👍 We prioritize the issues with most 👍
What happened?
Failed to submit tensorflow training job due to unsupported success policy:
What did you expect to happen?
Job should be submitted successfully when success policy is not specified.
Environment
Arena version:
Impacted by this bug?
Give it a 👍 We prioritize the issues with most 👍