kubeflow / arena

A CLI for Kubeflow.
Apache License 2.0
739 stars 178 forks source link

failed to validate command args: unsupported successPolicy #1169

Closed ChenYi015 closed 1 month ago

ChenYi015 commented 1 month ago

What happened?

Failed to submit tensorflow training job due to unsupported success policy:

$ arena submit tf \
    --name=tf-git \
    --working-dir=/root \
    --gpus=1 \
    --image=kube-ai-registry.cn-shanghai.cr.aliyuncs.com/kube-ai/tensorflow:1.5.0-devel-gpu \
    --sync-mode=git \
    --sync-source=https://code.aliyun.com/xiaozhou/tensorflow-sample-code.git \
    --data=nas-pvc:/mnt \
    --tensorboard \
    --logdir=/mnt/tf_data/logs \
    "python code/tensorflow-sample-code/tfjob/docker/mnist/main.py --log_dir /mnt/tf_data/logs  --data_dir /mnt/tf_data/"
ERRO[0000] failed to validate command args: unsupported successPolicy

What did you expect to happen?

Job should be submitted successfully when success policy is not specified.

Environment

Arena version:

$ arena version
arena: v0.10.0
  BuildDate: 2024-10-14T02:30:12Z
  GitCommit: 62ed8b98a8c2edd6196e3b885d07b525e24e0c2d
  GitTreeState: clean
  GitTag: v0.10.0
  GoVersion: go1.23.1
  Compiler: gc
  Platform: darwin/arm64

Impacted by this bug?

Give it a 👍 We prioritize the issues with most 👍