kubeflow / arena

A CLI for Kubeflow.
Apache License 2.0
733 stars 177 forks source link

Update model manage #1063

Closed ChenYi015 closed 5 months ago

ChenYi015 commented 6 months ago

Update model description

When submitting a traing job, the full submit command will be used as model version description. But function getFullSubmitCoomand does not work well when comes up with array flags.

Check whether tracking server is healthy

When creating a MLflow client, check whether the tracking server status is healthy.

Support MLflow client proxied by Kubernetes API server

If environemnt variable MLFLOW_TRACKING_URI is specified, then creates a non-proxied MLflow client, otherwise, a MLflow client proxied by kubernetes api server will be created. This proxied client will search services named with ack-mlflow or mlflow in all namespaces. If no such mlflow service found, an error will be throwed, and if multiple mlflow services found, use the first service.

Code Simplicfication

When get and print a training job or serving job, there is some code with the same logic, and it would be better to extract a function.