Vahe1994 / SpQR

Apache License 2.0
515 stars 40 forks source link

Why no save? #11

Closed yhyu13 closed 1 year ago

yhyu13 commented 1 year ago

HI, Thanks for sharing this great quantization technique

But I am not sure I understand why saving is not supported at this moment, in the main.py

    if args.save or args.save_safetensors:
        raise NotImplementedError()

What are the obstacles that prevent compressed models from being saved/reload?

Do you know any techniques that allow dumping the model VRAM to disk and reload from disk directly?

qwopqwop200 commented 1 year ago

https://github.com/Vahe1994/SpQR/issues/1

yhyu13 commented 1 year ago

Sure, gonna stay tuned!