AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
2.15k
stars
383
forks
source link
MMP: Propagate user requests upstream, apply requests to quantsim #3466
Closed
quic-ashvkuma closed 1 week ago