adding custom arithmetic

Is your feature request related to a problem? Please describe. There is a C++ library https://github.com/stillwater-sc/universal for mixed-precision algorithm development and optimization that has tens of thousands of arithmetic types that could be leveraged in quantization. From custom floating-points and fixed-points, to tapered floating points in posits and takums, to logarithmic and double base systems.

How would we go about integrating that capability into the llm-compressor

Describe the solution you'd like architecture evaluation to make certain that this is reasonable engineering effort that would be win-win for both environments

Describe alternatives you've considered We have directly integrated into PyTorch, but that kept bit rotting due to the rapid change of PyTorch. We currently do everything through Intel's floating-point compressor library.

Additional context mixed-precision algorithms have been very valuable in the HPC and DSP verticals and are being rediscovered in the new AI space. There is a wealth of knowledge in the HPC and DSP space about custom arithmetic that could rapidly be applied to AI model quantization.

vllm-project / llm-compressor

adding custom arithmetic #41