Closed charbull closed 8 months ago
Hi, in general you shouldn't use the --weights parameter at this time.
In the future you can use it to load fine tuned weights and make compressed versions, but that requires a python script to convert weights https://github.com/google/gemma.cpp/issues/11
Instead, only use the compressed weights (you don't need the keras weights, just the sfp files from the GemmaCpp download page).
Second, you probably want to start with the -it "instruction tuned" models which are more appropriate for interactive use. the -pt "pretrained models" are more of a starting point for fine tuning. So in summary:
2b-it-sfp.sbs
./gemma \
--tokenizer vocabulary.spm \
--compressed_weights 2b-it-sfp.sbs --model 2b-it
and see if that works.
Closing for now but if you still run into an issue we'll reopen and help.
Thanks ! that worked, I was using the keras weights.
Hi,
I am experiencing the follow issue, I tried the following versions: https://www.kaggle.com/models/keras/gemma/frameworks/Keras/variations/gemma_2b_en/versions/1 https://www.kaggle.com/models/keras/gemma/frameworks/Keras/variations/gemma_2b_en/versions/2
Any ideas how to resolve?
Cheers, Charbel