Open jeff52415 opened 1 year ago
@jeff52415 I managed to do something similar with my fork at https://github.com/vihangd/alpaca-qlora I am also trying to add support for more models.
Hi @vihangd, I'd like to try your fork, but why you remove export_hf_checkpoint.py
?
@jeff52415 Thanks for the PR, I'll try it.
@kocoten1992 I am working on adding it back.. just need to ensure it works with GPT Neo X models as well..
@kocoten1992 The fork now includes export_hf_checkpoint.py that works with any model.
The current system does not support 4-bit training and inference. However, given that it could be feasibly implemented with relative ease, I am willing to assist in integrating this feature.