Closed chaelli closed 4 weeks ago
hi @chaelli there's already a PR in progress for streaming responses here https://github.com/microsoft/kernel-memory/pull/400
@dluc I know - that's why I made it "draft" - didn't know this will also list it here. I just wanted to quickly show the "simple" way. Will close this. Also added a comment on the discussion #625 just hope we can get some streaming solution - for direct usage in UIs this is very important
Motivation and Context (Why the change? What's the scenario?)
High level description (Approach, Design)