neuralmagic / nm-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs
https://nm-vllm.readthedocs.io
Other
251 stars 10 forks source link

use v1.0.0 tag for nm-actions #367

Closed dhuangnm closed 4 months ago

dhuangnm commented 4 months ago

Failure doesn't look related to changes in the PR, landing.

dhuangnm commented 4 months ago

Reran the failed job and it passed this time.