Support GPUs on multiple machines (via docker-swarm or kubernetes)?

ml-tooling / ml-hub

🧰 Multi-user development platform for machine learning teams. Simple to setup within minutes.

Apache License 2.0

301 stars 64 forks source link

Feature description:

Support docker-swarm (with GPUs support) out-of-the-box.

Problem and motivation:

As here describes, CURRENTLY it is not possible to run ml-hub with GPU support across multiple machines (while every machine may have one or more GPU cards). Since it is not easy to build a kubernetes cluster with GPU support and management (and I'm not farmiliar with kubernetes), maybe a more lightweight solution (like docker-swarm?) would support it more seamlessly (via nvidia-docker).

Is this something you're interested in working on?

Yes

ml-tooling / ml-hub

Support GPUs on multiple machines (via docker-swarm or kubernetes)? #19