Open richiejp opened 3 months ago
I have two patches for DeepSpeed. One allowed MIG to work and the other just adds a health check. The MIG one looks difficult to get upstreamed. The health check could be upstreamed easily if they accept it or dropped in favor of just checking the gRPC backend.
There's actually no suitable DeepSpeed-MII container AFAIK. So I'm moving closer to the idea of 1., but probably include the docker image inside the operator repo because there is other stuff inside the container repo and it would be nice to have at least the open source stuff in a mono-repo
Also no response on the PR, I'm moving to the post release milestone and will just make the container public.
actually I may just bring the containers repo into this one.
We could: