xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.
The duplicate definition causes issues with newer versions of kueue failing in something like this:
The JobSet "xyz" is invalid: spec.replicatedJobs[0].template.spec.template.spec.containers[1].env[10]: Duplicate value: map[string]interface {}{"name":"JOBSET_NAME"}
Fixes / Features
-
Testing / Documentation
Testing details.
[ y/n ] Tests pass
[ y/n ] Appropriate changes to documentation are included in the PR
The duplicate definition causes issues with newer versions of kueue failing in something like this:
The JobSet "xyz" is invalid: spec.replicatedJobs[0].template.spec.template.spec.containers[1].env[10]: Duplicate value: map[string]interface {}{"name":"JOBSET_NAME"}
Fixes / Features
-
Testing / Documentation
Testing details.