sql-machine-learning / elasticdl

Kubernetes-native Deep Learning Framework
https://elasticdl.org
MIT License
733 stars 113 forks source link

create worker service and set TF_CONFIG env when needed #2462

Closed skydoorkai closed 3 years ago

skydoorkai commented 3 years ago
  1. Add need_tf_config arg.
  2. Create worker service
  3. Set TF_CONFIG for ps/worker using the service domain name.
  4. Patch worker service when the corresponding worker pod changes.