Open azazar opened 2 months ago
I've found this too, but I think it's related to making the chat pretty when streaming. Turning off streaming or pretty seems to return performance.
Have you tried either of those?
Edit: ah just saw the stack trace, yep, seems to be a similar issue to what I experience.
It worked. Turning off streaming and pretty output helps.
I'm going to close this issue for now, but feel free to add a comment here and I will re-open or file a new issue any time.
I actually like streaming and pretty output.
I would like to request you reopen the item because it isn't fixed. Long responses that refactor several files take longer and longer as you go due to the O(N^2) algorithm for formatting the markdown. I'm seeing 1 token per second from claude sonnet.
The main problem with using the --no-pretty workaround is that you lose the ability to use previous commands by pressing the up-arrow. It just outputs some escape commands instead, at least on Ubuntu.
architect> ^[[A^[[A
Aider is creating high CPU load when dealing with large LLM responses and long patches. And it takes very long time to process a response from the LLM.