Why no save? - Githubissues

Vahe1994 / SpQR

Apache License 2.0

515 stars 40 forks source link

Closed yhyu13 closed 1 year ago

yhyu13 commented 1 year ago

HI, Thanks for sharing this great quantization technique

But I am not sure I understand why saving is not supported at this moment, in the main.py

    if args.save or args.save_safetensors:
        raise NotImplementedError()

What are the obstacles that prevent compressed models from being saved/reload?

Do you know any techniques that allow dumping the model VRAM to disk and reload from disk directly?

qwopqwop200 commented 1 year ago

yhyu13 commented 1 year ago

Sure, gonna stay tuned!