Models sometimes prompt themselves

ollama / ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

MIT License

100.41k stars 8.01k forks source link

It looks like there could be stop words missing from the default llama2-uncensored modelfile, these tell the LLM when to stop generating more text.

As a workaround until this gets fixed you can create your own llama2-uncensored Modelfile with the correct stop words. Here is how you do that:

Create the Modelfile.


FROM llama2-uncensored:latest
TEMPLATE """### HUMAN:
{{ .Prompt }}

""" PARAMETER stop "### Input:" PARAMETER stop "### Response:" PARAMETER stop "### human"


2. Load the custom model into Ollama via the CLI:

$ ollama create llama2-uncensored:custom -f path/to/Modelfile


3. Now you can run it and the generation should stop when stop patterns are detected.

$ ollama run llama2-uncensored:custom

hello Hello back!

ollama / ollama