Release v0.5.5 - Githubissues

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

https://docs.vllm.ai

Apache License 2.0

26.5k stars 3.88k forks source link

Release v0.5.5 #7481

Closed simon-mo closed 2 weeks ago

simon-mo commented 4 weeks ago

We will make a release later this week or early next week (Aug 16-Aug19) to address Gemma logits soft-caps bug, openai server metrics bug, and include more performance enhancements.

Please add blockers if needed.

robertgshaw2-neuralmagic commented 4 weeks ago

Will focus on getting these over the line tomorrow and thursday:

njhill commented 4 weeks ago

[x] https://github.com/vllm-project/vllm/pull/7424

simon-mo commented 4 weeks ago

[x] #7493

robertgshaw2-neuralmagic commented 4 weeks ago

[x] https://github.com/vllm-project/vllm/pull/7443

simon-mo commented 3 weeks ago

[x] #7415

mgoin commented 3 weeks ago

[x] https://github.com/vllm-project/vllm/pull/7534

WoosukKwon commented 3 weeks ago

[x] #7638
[x] #7642

youkaichao commented 3 weeks ago

njhill commented 3 weeks ago

[x] https://github.com/vllm-project/vllm/pull/7698

robertgshaw2-neuralmagic commented 3 weeks ago

Not required but nice + easy:

[x] https://github.com/vllm-project/vllm/pull/7716

njhill commented 3 weeks ago

[x] https://github.com/vllm-project/vllm/pull/7746 (DoS issue)

sindhuvahinis commented 3 weeks ago

[x] A fix for phi-v3.5 model https://github.com/vllm-project/vllm/pull/7710

Jimmy-Newtron commented 2 weeks ago

... a release later this week or early next week (Aug 16-Aug19) ...

When do you plan the release now?

simon-mo commented 2 weeks ago

Now