Open scikkk opened 9 months ago
I made 2 changes and the conversion secceeded:
# LINE 308: num_kv_heads = params["n_kv_heads"]
num_kv_heads = params["n_kv_heads"] if "n_kv_heads" in params else params["n_heads"]
# LINE 383: if model_size == "7B":
if model_size == "7B" and "layers.0.attention.inner_attention.rope.freqs" in loaded[0]:
I'm stuck at the codellama/7B/params.json
file not being json. It's an HTML file?
raise JSONDecodeError("Expecting value", s, err.value) from None
json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)
When I view the html file in a browser, it's a "Sign in to continue to Gmail" login page.
Hello! Thanks for your great work, but I met some problems when trying to replicate the results.
Specifically, I cannot find convert_raw_llama_weights_to_hf.py as depicted in README.md .
However, I found convert_raw_llama_weights_to_neox.py, which seems can convert Meta->NeoX format.
But the python script doesn't support
--config_file
, so I use--model_size=7B
instead. Unfortunately, I met an error:Here is my convert.sh:
Here is my raw codellama7b:
Looking forward to your reply, any help will be appreciated!