Open thejohnd0e opened 6 months ago
This is happening to me too on Windows with a 4090 when I try to use the OpenAI API. The WebUI itself is working fine. Tried LLama3-8B with exLlama and GGUF and neither worked properly.
Similar issue on WSL.
Same here. Generate in NoteBook and Default works fine.
ERROR Failed to build the chat prompt. The input is too long for the available context length.
Truncation length: 0
max_new_tokens: 512 (is it too high?)
Available context length: -512
Have the same problem with some linux based A40s and it's well into September.
Describe the bug
Failed to build the chat prompt.
Is there an existing issue for this?
Reproduction
Failed to build the chat prompt.
Screenshot
Logs
System Info