bigcode-project / starcoder.cpp

C++ implementation for 💫StarCoder
445 stars 36 forks source link

Starcoder conversion and quantization instructions #5

Closed aseok closed 1 year ago

aseok commented 1 year ago

Hi. Pls provide conversion and quantization instructions of the main starcoder model files.

NouamaneTazi commented 1 year ago

You can find the instructions here: https://github.com/bigcode-project/starcoder.cpp#quick-start

aseok commented 1 year ago

I have downloaded model files separately and skip downloading them in convert-hf-to-ggml.py. my problem is in quantization and probably running inference, how to pass the model files in quantization command? Should I rename them?

asingleoat commented 1 year ago

how to pass the model files in quantization command?

for the sharded model conversion don't pass the filenames, pass the directory: $ python convert-hf-to-ggml.py ./starcoder

then quantization is as in the README example: $ ./quantize starcoder-ggml.bin starcoder-ggml-q4_1.bin 3

NouamaneTazi commented 1 year ago

Does that fix your issue @aseok?