Open flozi00 opened 1 week ago
Higher throughput und memory savings are always cool 😎
I think that could be integrated very easily, what do you think about it's design ?
https://github.com/PygmalionAI/aphrodite-engine/commit/73177656ed75ec880a409640ef2b9a8043cf96a8
No response
https://github.com/vllm-project/vllm/pull/8751
Motivation.
Higher throughput und memory savings are always cool 😎
I think that could be integrated very easily, what do you think about it's design ?
Proposed Change.
https://github.com/PygmalionAI/aphrodite-engine/commit/73177656ed75ec880a409640ef2b9a8043cf96a8
Feedback Period.
No response
CC List.
No response
Any Other Things.
No response
Before submitting a new issue...