Closed melodyliu1986 closed 1 month ago
You need to sign commit git commit -a --amend -s git push --force
You need to sign commit git commit -a --amend -s git push --force Done, please check again.
@MichaelClifford PTAL
Do you always need an HF_TOKEN? Can this still be used without it? If so can we make it optional in the podman run command? (I could be sadly mistaken).
Do you always need an HF_TOKEN? Can this still be used without it? If so can we make it optional in the podman run command? (I could be sadly mistaken).
So how to do that? copy the HF_TOKEN into the image by Containerfile?
No I am questioning whether this change forces users to always have and specify a token. I don't really know how this all works, but it seems that if an image is available without a token, this change will force users to specify a token even if one does not exist.
Made mistakes when sign-off, will fork a new branch to implement the changes.
I want to use the mistralai/Mistral-7B-Instruct-v0.2 models, and found there are no gguf files in HuggingFace, then I decided to use the ./convert_models functions to convert the model. I found there are some issues exist:
So I added the HF_TOKEN= parameter in the code.
Impacted files: README.md, download_huggingface.py, run.sh
If we go to https://github.com/ggerganov/llama.cpp.git, we can find the convert.py has been deprecated and moved to examples/convert_legacy_llama.py. I am not sure if I should just keep the line "python llama.cpp/convert-hf-to-gguf.py /opt/app-root/src/converter/converted_models/$hf_model_url", I just replace the convert.py with the correct path. also for llama.cpp/quantize
Impacted file: run.sh
So I added "localhost/converter" in the "podman run" command.
Here is my testing after the modification:
(the log is too long)