microsoft / DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Apache License 2.0
1.76k stars 163 forks source link

Any plans for produnction-ready services? #454

Open SeungminHeo opened 3 months ago

SeungminHeo commented 3 months ago

Hello :) With this wonderful performance project, we tested and got tremendous improvement on our model inference service.

One thing that I wanna know is, Deepspeed-fastgen has any plans to be improved as production-ready serving framework?

For instance, I think some features have to meet expectations.

Maybe, some other features can be added. If you have in mind to improve this project, some contribution can be made but roadmap has to be made.

I want to hear opinion of you. Thank you.