vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Apache License 2.0
644 stars 52 forks source link

Update test #98

Closed dsikka closed 2 months ago

dsikka commented 2 months ago

SUMMARY: "please provide a brief summary"

TEST PLAN: "please outline how the changes were tested"