Open Lin-Qingyang-Alec opened 8 months ago
same here
@monuminu From your homepage, it appears that you work at Microsoft. Don’t you know where this part of the code can be obtained?😂
Let me get the info :)
@monuminu Ok, please keep me posted if there are any updates.
Code will be released in some days
Thank you, I am looking forward to the day when the code is available.
As per the blog, this seems implemented or part of vLLM
Our approach is now part of vLLM(opens in new tab) and can also be implemented with other frameworks.
@kd303 I have read this blog yet. But do you know how to use it in vllm? I can not find any keyword in vllm code.
Thanks for the interest in Splitwise code. We have opened https://github.com/vllm-project/vllm/issues/2472 to track the open sourcing of our internal prototype.
Microsoft have claimed that ”Splitwise“ is supported in vLLM, see https://www.microsoft.com/en-us/research/blog/splitwise-improves-gpu-usage-by-splitting-llm-inference-phases/
So how to use it in vLLM? I could not find keyword about ”Splitwise“.