deepjavalibrary / djl-serving

A universal scalable machine learning model deployment solution
Apache License 2.0
189 stars 63 forks source link

[vLLM] support future arctic model FP8 quantization #1993

Closed lanking520 closed 2 months ago

lanking520 commented 3 months ago

Description

Add arctic model support