I noticed that the opt.py file in the repository provides a method for saving quantized weights, but this functionality is not available in the bloom.py file. I was wondering if it would be possible to add this feature to bloom.py as well.
Being able to save quantized weights is a really useful feature for optimizing the size of models, and it would be great to have this functionality available in all relevant files in the repository.
If this feature could be added to bloom.py, I think it would be a really helpful addition for anyone who is working with this file.
Description:
Hi there,
I noticed that the opt.py file in the repository provides a method for saving quantized weights, but this functionality is not available in the bloom.py file. I was wondering if it would be possible to add this feature to bloom.py as well.
Being able to save quantized weights is a really useful feature for optimizing the size of models, and it would be great to have this functionality available in all relevant files in the repository.
If this feature could be added to bloom.py, I think it would be a really helpful addition for anyone who is working with this file.
Thank you for your time and consideration.
Best regards,