Open kgilpin opened 4 weeks ago
In one of your examples, the API is failing because it's overloaded. I don't believe there's any way around this other than retrying, but that didn't seem to work in this case.
{"type":"error","error":{"details":null,"type":"overloaded_error","message":"Overloaded"} }
In other cases, it seems likely that it's the system prompt which is overflowing the token limit. The current implementation will only truncate user messages.
An example: https://github.com/getappmap/navie-benchmark/actions/runs/11522951430/job/32080178265#step:7:2114
Another example: https://github.com/getappmap/navie-benchmark/actions/runs/11522951430/job/32080178265
Search for
Context length exceeded. Reducing token count and retrying
(direct linking is not working for some reason).