allyourbot / hostedgpt

An open version of ChatGPT you can host anywhere or run locally.
MIT License
192 stars 75 forks source link

Voice mode v1 #311

Closed krschacht closed 4 days ago

krschacht commented 2 weeks ago

This is a large PR. Fundamentally, it introduces the first version of a voice interaction feature. It should be working pretty well however there is not full test coverage so the feature is initially disabled. It can be enabled in features.yml

The core functionality is: turn on the mic, speak aloud instead of typing, responses are spoken back, any time you say "see this" or "my screen" or other references which indicate you're looking at something it will take a screenshot, if you say "hold on Samantha" then it will mostly stop listening but you can reinvoke it by saying something like "Samantha, I'm back." Here is a short demo video:

https://share.zight.com/L1ubg6j4

Included in this PR:

There is a fair amount of plumbing for blocks: