AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
2.15k
stars
383
forks
source link
Ensure the synchronization of parameters using zero offload #3436
Closed
quic-huzh closed 2 weeks ago
Fixes #3435