Closed grencez closed 1 year ago
This doesn't really make sense, especially given how we're doing the instruction-following format (https://github.com/rendezqueue/rendezllama/issues/5#issuecomment-1528479960).
Re-opening as a "keep EOS delimiter token" and support it in Alpaca format.
So I guess we want to keep the EOS intact all the time. And insert it when appropriate.
We don't insert an EOS when editing or reading in a rolling prompt... not sure if I want to bother.
However, we still need to document the ((sentence_terminals) "\n")
option. It is documented in the assistant_alpaca example, but nowhere else.
When llama.cpp's main example runs in "instruct" mode, an end-of-sentence (EOS) token is placed after each chatbot response. I'm not sure if this is only useful for Alpaca models, but it seems worth trying.
Let's switch into this mode when newline is an end-of-sentence token.