neuralmagic / sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
Apache License 2.0
2.07k stars 148 forks source link

[Feature Branch] Quant modifier UX #2263

Closed rahul-tuli closed 6 months ago

rahul-tuli commented 6 months ago

Quantization Modifier UX Update

Description

This PR refactors the quantization modifiers to enhance user experience and simplify the system architecture. It is based off of changes from ~the sa/quant_mod_refactor~ main branch, all subsequent changes will be merged as smaller bites sized PRs into this. Key updates include:

Reference Documentation

For more detailed information about the changes and their impact, please refer to the documentation here.