quic / aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
https://quic.github.io/aimet-pages/index.html
Other
2.15k stars 383 forks source link

MMP: Propagate user requests upstream, apply requests to quantsim #3466

Closed quic-ashvkuma closed 1 week ago