Open k8si opened 4 days ago
I can't reproduce this. Could you trying passing https://huggingface.co/Mozilla/Meta-Llama-3.1-8B-Instruct-llamafile/resolve/main/Meta-Llama-3.1-8B-Instruct.Q5_K_M.llamafile as the --model
flag? It may be an issue with your GGUF file metadata.
Contact Details
ksilverstein@mozilla.com
What happened?
Summary: Using the llamafiler
/tokenize
endpoint does not seem to add special tokens when the corresponding flag is set to true, at least for llama-3.1-8b-instruct.Model/system info:
Command used to start llamafiler:
Curl to reproduce issue:
Output:
For comparison, here is a script to do the same thing in python using the HuggingFace transformers library directly:
Output:
Version
llamafile v0.8.16 llamafiler v0.8.16 (but actually I built from source at commit e5c0921)
What operating system are you seeing the problem on?
Mac
Relevant log output
No response