Closed ChenMnZ closed 7 months ago
OmniQuant is a PTQ method that supports both weight-only quantization and weight-activation quantization.
OmniQuant is a PTQ method that supports both weight-only quantization and weight-activation quantization.