neuralmagic / nm-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs
https://nm-vllm.readthedocs.io
Other
251 stars 10 forks source link

bump version to 0.5.1 #330

Closed dhuangnm closed 4 months ago

dhuangnm commented 4 months ago

Failure is not relating to version change, merging.