AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
2.15k
stars
383
forks
source link
Declare quantized lora layers only when peft library is present #3452
Closed
quic-kyunggeu closed 2 weeks ago