This is due to passing generators around for streaming events (ie, context within apu generated by tool calls). Likely this fix will primarily revolve around aggregating stacks when handling errors that bubble up from these generators.
Repro
execute llm request with bad model
Stack trace of reported error in server logs does not make sense
Success Criteria
intentional stack trance produced in the following scenarios...
This is due to passing generators around for streaming events (ie, context within apu generated by tool calls). Likely this fix will primarily revolve around aggregating stacks when handling errors that bubble up from these generators.
Repro
execute llm request with bad model Stack trace of reported error in server logs does not make sense
Success Criteria
intentional stack trance produced in the following scenarios...