Are there some resources that explain how the quantized parameters are structured in a GGUF file?
We are interested in porting HQQ-quantized models into GGUF format, but in order to do that, we need to know exactly how it is stored.
We basically need to know:
Hello!
Are there some resources that explain how the quantized parameters are structured in a GGUF file? We are interested in porting HQQ-quantized models into GGUF format, but in order to do that, we need to know exactly how it is stored. We basically need to know:
Thanks!