Closed sarthfrey closed 4 years ago
cc @yuefengz can you help review this PR?
@yuefengz ping :)
@jhseu What is your recommended CI solution? This PR uses github workflow. Is it okay?
Ping @jhseu and @yuefengz again for CI choices.
For CI, GitHub workflow is fine for now, I think. We haven't setup an alternative for this repo.
Thanks @yuefengz! I addressed all the feedback. @jhseu are we good to merge?
Made a release here: https://pypi.org/project/spark-tensorflow-distributor/0.0.3/
@guptapriya @yuefengz @jhseu also please let me know who from TensorFlow I should add as an owner on the PyPi project :)
This PR aims to act as both a proposal and an initial version for a contribution of the spark-tensorflow-distributor python package. As mentioned in tensorflow#151 the general mandate of this package is to make it easier for users to do distributed training with TensorFlow 2 on their Spark clusters. Currently this package primarily acts as a job launcher for starting TensorFlow servers, configuring GPU and CPU resources for the user based on Spark resource scheduling so that they may easily run their deep learning workloads.
This PR also includes CI with GitHub workflows, which acts at the repository level by default. However, the CI is set up so that the checks will only be triggered by changes to this package's subdirectory in the ecosystem. This behavior is described in
.github/workflows/spark-tensorflow-distributor.yml
.I'd also like to publish this package to PyPi and am wondering if there's an ecosystem specific process for that.
Welcoming any and all feedback on this PR :)
cc @guptapriya @mengxr @husseinnagr-db