Closed steveshaoucsb closed 11 months ago
try increasing the context size to 2048
try increasing the context size to 2048
Thanks! Now that error disappeared, but the new problem is that the llama 2 doesn't finished the answer and apparently aborted at some point. Here are examples:
That's weird. The dialog is terminated without an error only if llama generates an end-of-session token. If you continue the conversation, will it continue normally or with an error?
It just continue without showing obvious error message, other than the conversation ends early.
Tell me, did the update solve the problem?
Just finished the update. The problem apparently still exists as llama2 still ends its reply early.
Can you send me the text of the request? I'll try it myself.
Yes. Here is my prompt:
Give me a Seoul travel plan
I think I found what the problem was. Tell me, did the update solve the problem?
I think I found what the problem was. Tell me, did the update solve the problem?
It fixed!
When I ask llama 2 after the it successfully generated an answer, or when it is in the process of generating long answers, it will always shows up “Eval error” and I need to restart the app to let the llama 2 work again.