Refresh after full generation to inject content and re-render

As the person who's written the front-end, it's actually not a problem whatsoever to process stuff as we receive it. It gives a far better user experience than waiting (doubly true if we ever make that swap to gpt-4), and even just the one-step-beyond-trivial implementation we've got right now renders absolutely instantly for me. Should that ever not be the case, there's a lot of low hanging fruit optimizations that I suspect could bring rendering performance up by at least an order of magnitude.

It's true - there exist some types of text processing that would need global information, but everything on the roadmap can totally make due either completely chuck by chunk, or with some very minor look-ahead buffer. Even if we add some insane feature down the road, we could reprocess just for that and keep the good UX for everything else.

If you're not satisfied with this answer, I would love to talk through the code model with anyone interested \:)

StampyAI / stampy-chat

Refresh after full generation to inject content and re-render #76