Open SinanAkkoyun opened 1 year ago
Does load_in_4bit
need to be implemented for each model type? I get this error: TypeError: GPTBigCodeForCausalLM.__init__() got an unexpected keyword argument 'load_in_4bit'
I got this working. The issue is that the 4-bit integration hasn't been pulled into the accelerate or transformers releases on pypy yet.
If you upgrade both to main
(accelerate-0.20.0.dev0 and transformers-4.30.0.dev0), you will be good to go.
Hello, thank you for the awesome work!!!
What would I need to do/modify to make this work with StarCoder Base?
Thank you!