In the web interfaces for ChatGPT and (especially) Claude chats, the streaming feels super smooth, i.e. responses are pieced together in components even finer than tokens. It feels more "chunky" in pal. This may just be a function of responses from pal tending to be shorter than the few paragraphs typical of an unprompted ChatGPT or Claude, and thus being comprised of fewer total tokens, but I do think a really satisfying interface would make it feel like you're actually watching the model "type."
In the web interfaces for ChatGPT and (especially) Claude chats, the streaming feels super smooth, i.e. responses are pieced together in components even finer than tokens. It feels more "chunky" in pal. This may just be a function of responses from pal tending to be shorter than the few paragraphs typical of an unprompted ChatGPT or Claude, and thus being comprised of fewer total tokens, but I do think a really satisfying interface would make it feel like you're actually watching the model "type."